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Abstract 

Adaptive  array  processing  algorithms  have  achieved  widespread  use  because  they  are 
very  effective  at  rejecting  unwanted  signals  (i.e.,  controlling  sidelobe  levels)  and  in 
general  have  very  good  resolution  (i.e.,  have  narrow  mjunlobes).  However,  many 
adaptive  high-resolution  array  processing  algorithms  suffer  a  significant  degradation 
in  performance  in  the  presence  of  environmental  mismatch.  This  sensitivity  to  envi¬ 
ronmental  mismatch  is  of  particular  concern  in  problems  such  as  long-range  acoustic 
array  processing  in  the  ocean  where  the  array  processor’s  knowledge  of  the  propaga¬ 
tion  characteristics  of  the  ocean  is  imperfect.  An  Adaptive  Minmax  Matched  Field 
Processor  has  been  developed  which  combines  adaptive  matched  field  processing  and 
minmax  approximation  techniques  to  achieve  the  effective  interference  rejection  char¬ 
acteristic  of  adaptive  processors  while  limiting  the  sensitivity  of  the  processor  to 
environmental  mismatch. 

The  derivation  of  the  algorithm  is  carried  out  within  the  framework  of  minmax 
signal  processing.  The  optimal  array  weights  are  those  which  minimize  the  maximum 
conditional  mean  squared  estimation  error  at  the  output  of  a  linear  weight-and-sum 
beamformer.  The  error  is  conditioned  on  the  propagation  characteristics  of  the  envi¬ 
ronment  and  the  maximum  is  evaluated  over  the  range  of  environmental  conditions  in 
which  the  processor  is  expected  to  operate.  The  theorems  developed  using  this  frame¬ 
work  characterize  the  solutions  to  the  minmax  array  weight  problem,  and  relate  the 
optimal  minmax  array  weights  to  the  solution  to  a  particular  type  of  Wiener  filtering 
problem.  This  relationship  makes  possible  the  development  of  an  efficient  algorithm 
for  calculating  the  optimal  minmax  array  weights  and  the  associated  estimate  of  the 
signal  power  emitted  by  a  source  at  the  array  focal  point.  An  important  feature  of 
this  algorithm  is  that  it  is  guarenteed  to  converge  to  an  exact  solution  for  the  array 
weights  and  estimated  signal  power  in  a  finite  number  of  iterations. 
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The  Adaptive  Minmax  Matched  Field  Processor  can  also  be  interpreted  as  a  two- 
stage  Minimum  Variance  Distortionless  Response  (MVDR)  Matched  Field  Processor. 
The  first  stage  of  this  processor  generates  an  estimate  of  the  replica  vector  of  the  signal 
emitted  by  a  source  at  the  array  focal  point,  and  the  second  stage  is  a  traditional 
MVDR  Matched  Field  Processor  implemented  using  the  estimate  of  the  signal  replica 
vector. 

Computer  simulations  using  several  environmental  models  and  types  of  environ¬ 
mental  uncertainty  have  shown  that  the  resolution  and  interference  rejection  capabil¬ 
ity  of  the  Adaptive  Minmax  Matched  Field  Processor  is  close  to  that  of  a  traditional 
MVDR  Matched  Field  Processor  which  has  perfect  knowledge  of  the  characteristics 
of  the  propagation  environment  and  far  exceeds  that  of  the  Bartlett  Matched  Field 
Processor.  In  addition,  the  simulations  show  that  the  Adaptive  Minmax  Matched 
Field  Processor  is  able  to  maintain  it’s  accuracy,  resolution  and  interference  rejection 
capability  when  it’s  knowledge  of  the  environment  is  only  approximate,  and  is  there¬ 
fore  much  less  sensitive  to  environmental  mismatch  than  is  the  traditional  MVDR 
Matched  Field  Processor. 

Thesis  Supervisor:  Alan  V.  Oppenheim 

Title;  Distinquished  Professor  of  Electrical  Engineering 

Thesis  Supervisor:  Arthur  B.  Baggeroer 
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Chapter  1 
Introduction 


The  signals  received  by  spatial  arrays  of  sensors  are  often  composed  of  the  sum  of 
signals  emitted  by  sources  at  different  locations.  In  order  to  estimate  the  signal,  or 
the  parameters  of  the  signal,  emitted  by  a  source  at  a  particular  location,  the  array 
processor  must  often  separate  that  signal  from  the  other  signals  which  are  received. 
This  separation  of  signals  based  upon  the  location  of  the  source  is  referred  to  as 
spatial  filtering.  Thus,  the  spatial  filtering  of  signals  received  by  an  array  of  sensors 
to  generate  estimates  of  the  parameters  of  the  signals  emitted  by  sources  at  locations 
of  interest  is  an  important  operation  in  many  array  processing  applications. 

Array  processors  achieve  spatial  discrimination  through  filtering  by  exploiting  the 
fact  that  the  spatial  characteristics  of  a  propagating  signal  as  received  at  an  array  of 
sensors  depend  upon  the  location  of  the  source  of  the  signal.  However,  the  spatial 
characteristics  of  a  propagating  signal  also  depend  upon  the  characteristics  of  the 
medium  through  which  the  signal  is  propagating.  Therefore,  if  a  processor  has  in¬ 
accurate  or  incomplete  information  concerning  the  characteristics  of  the  propagation 
environment,  it  may  be  unable  to  determine  the  spatial  characteristics  which  should 
be  exhibited  by  a  signal  emitted  by  a  source  at  the  location  of  interest.  In  this  case, 
the  processor  may  have  difficulty  in  accomplishing  the  spatial  filtering  necessary  to 
estimate  the  parameters  of  the  signal  of  interest.  This  work  proposes  an  approach  to 
array  processing  which  yields  a  processor  capable  of  operating  with  only  approximate 
environmental  information  while  at  the  same  time  achieving  levels  of  spatial  discrim- 
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ination  which  are  close  to  those  achieved  by  adaptive  processors  having  accurate  and 
detailed  environmental  information. 

The  remainder  of  this  chapter  contains  general  background  information  on  array 
processing.  Section  1.1  discusses  general  linear,  adaptive,  and  matched  field  pro¬ 
cessing.  Section  1.2  describes  a  parameterization  of  the  spatial  characteristics  of 
propagating  signals,  which  is  useful  for  the  class  of  algorithms  considered  herein.  The 
problems  which  array  processors  exhibit  when  the  environmental  information  is  in¬ 
accurate,  and  possible  approaches  to  developing  processors  which  are  able  to  operate 
effectively  with  inaccurate  or  imprecise  information  are  reviewed  in  Section  1.3.  This 
section  also  introduces  the  minmax  signal  processing  approach,  which  is  proposed 
herein  to  address  the  problem  of  array  processing  with  only  approximate  environ¬ 
mental  information. 

The  theoretical  foundations  of  the  minmax  approach,  based  on  the  Minmax  Char¬ 
acterization  Theorm,  are  developed  in  Chapter  2.  This  theorem  sets  forth  the  neces¬ 
sary  and  sufficient  conditions  which  must  be  met  by  any  solution  to  a  general  class 
of  minmax  problems.  The  details  of  the  proposed  array  processor,  referred  to  as  the 
Adaptive  Minmax  Matched  Field  Processor,  are  presented.  A  computationally  effi¬ 
cient  algorithm  which  is  guaranteed  to  converge  to  an  exact  solution  of  the  minmax 
optimization  problem  of  interest  is  developed  by  exploiting  the  special  structure  im¬ 
posed  on  the  solution  by  the  Minmax  Characterization  Theorem.  Finally,  an  approach 
to  bounding  the  minmax  performance  achievable  by  any  processor  is  proposed. 

In  Chapter  3,  the  structure  imposed  by  the  Minmax  Characterization  Theorem  is 
again  exploited  to  relate  the  Adaptive  Minmax  Matched  Field  Processor  to  Capon's 
Minimum  Variance  Distortionless  Response  (MVDR)  Matched  Field  Processor  [9, 11]. 
The  relationship  developed  leads  to  a  qualitative  analysis  of  the  processor.  This  anal¬ 
ysis  motivates  a  small  change  to  the  algorithm  developed  in  Chapter  2.  A  quantitative 
analysis  of  the  algorithm  based  on  results  of  numerical  simulations  is  also  presented. 
These  numerical  results  are  generated  for  both  deterministic  time-invariant  and  ran¬ 
dom  time-varying  propagation  environments.  The  results  for  the  latter  case  motivate 
another  small  change  to  the  algorithm  which  is  also  detailed. 
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Chapter  4  addresses  the  problem  of  generating  a  priori  estimates  of  the  spa¬ 
tial/temporal  characteristics  of  a  propagating  signal  as  a  function  of  environmental 
conditions  and  source  location.  This  chapter  does  not  present  original  work.  Instead, 
it  presents  results  developed  by  others  [29,  30,  31,  32,  33,  40,  41]  on  the  propagation 
of  signals  through  random  media  and  outlines  how  this  work  can  be  applied  to  gen¬ 
erating  estimates  of  the  spatial/temporal  signal  characteristics.  Finally,  the  results 
generated  herein  are  summarized  and  future  work  is  discussed  in  Chapter  5. 

1.1  Linear,  Adaptive,  and  Matched  Field  Processing 

Many  types  of  array  processors  either  implicitly  or  explicitly  incorporate  a  linear 
weight-and-sum  beamformer  to  implement  spatial  filtering.  This  filtering  allows  the 
processor  to  discriminate  among  signals  based  upon  the  location  of  the  source  of 
the  signals.  Given  an  input  |/  (the  joint  temporal /spatial  filtering  of  the  sampled 
vector  time  series  j/[n]  will  not  be  considered  in  this  introduction),  the  output  of  a 
linear  weight-and-sum  beamformer  is  x  =  where  jn  is  the  array  weight  vector, 
the  superscript  h  denotes  complex  conjugate  transpose  (i.e.,  Hermitian),  and  x  is  an 
estimate  of  the  signal  emitted  by  a  source  at  a  location  of  interest. 

Linear  beamformers  enjoy  widespread  use  for  several  reasons.  First,  they  generally 
have  the  lowest  computational  complexity  of  the  available  methods  of  implementing 
a  spatial  filter  (given  an  N  element  array,  the  filtering  operation  is  an  (?  (A^)  opera¬ 
tion  and,  if  required,  the  calculation  of  the  array  weights  to  minimize  a  squared  error 
criterion  is  often  an  O  {N^)  operation).  Second,  when  the  array  weights  are  chosen  to 
minimize  a  mean-squared  estimation  error  criterion,  the  solution  for  the  optimal  ar¬ 
ray  weights  is  a  convex  quadratic  minimization  problem  and  is  analytically  tractable. 
Third,  linear  filtering  preserves  the  actual  time-series  of  the  signal  of  interest  which 
is  important  in  many  applications.  Finally,  when  the  received  signal  consists  of  the 
sum  of  a  signal  of  interest  and  interfering  signals,the  spatial  correlation  of  the  inter¬ 
fering  signals  is  different  from  that  of  the  signal  of  interest,  and  the  signal  of  interest 
is  correlated  across  the  aperture  of  the  array,  the  linear  beamformer  is  effective  at 


12 


filtering  out  the  interfering  signals  and  generating  an  estimate  of  th  signal  of  interest. 

Another  class  of  array  processors  which  enjoys  widespread  use  it  the  adaptive 
array  processor.  Adaptive  array  processors  use  observations  of,  or  information  about, 
the  signal,  noise,  and  propagation  environments  to  adjust  the  characteristics  of  the 
processor  to  minimize  or  maximize  some  performance  criterion.  The  processors  are 
able  to  efficiently  use  the  degrees  of  freedom  available  to  the  processor  to  adjust  to 
the  environment  in  which  the  processor  is  operating.  [6]  The  most  widely-used  t)^e  of 
adaptive  processors  are  those  incorporating  adaptive  linear  beamformers  [7,  8].  Two 
such  examples  are  Capon's  MVDR  Processor  [9]  and  the  Applebaum  Beamformer 
[10].  These  processors  use  observations  of  the  combined  signal  and  noise  environment 
to  adaptively  adjust  the  array  weight  vector  to  optimally  pass  the  signal  of  interest 
through  the  filter  and  while  controlling  the  sidelobes  of  the  filter’s  spatial  response  to 
reject  the  interfering  signals  contained  in  the  received  signal.  In  order  to  distinguish 
between  the  signal  emitted  by  a  source  at  the  location  of  interest  from  all  other  signals, 
the  processor  uses  a  priori  estimates  of  the  spatial  characteristics  of  the  signals  of 
interest.  These  a  priori  estimates  depend  upon  the  manner  in  which  the  propagating 
signals  are  modeled. 

Traditionally,  array  processors  have  modeled  propagating  signals  as  plane  waves 
following  a  straight  line  path  from  the  source  to  the  array  of  sensors.  This  corresponds 
to  an  implicit  model  of  the  propagation  medium  as  being  homogeneous  and  infinite 
in  extent  and  the  source  being  far  from  the  array.  The  propagation  of  acoustic  waves 
through  the  ocean  is  not  modeled  accurately  in  this  manner.  Both  the  time-invariant 
and  the  time- varying  temperature,  salinity,  and  pressure  structures  of  the  ocean  are 
spatially- variant.  When  coupled  with  the  finite  extent  (principally  the  finite  depth) 
of  the  oceans,  these  spatially-vaxiant  structures  cause  acoustic  signals  to  propagate  in 
a  manner  which  deviates  significantly  from  that  predicted  by  the  plane- wave  model. 

This  deviation  has  both  adverse  and  advantageous  consequences.  The  adverse 
consequence  is  that,  if  a  plane- wave  model  is  used  by  the  processor,  the  spatial  char¬ 
acteristics  of  the  signal  of  interest  may  not  match  those  estimated  by  the  processor. 
In  this  situation,  which  is  referred  to  as  model  mismatch,  the  processor  may  treat  the 
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signal  of  interest  as  an  interfering  signal  and  attempt  to  reject  it.  A  more  detailed 
discussion  of  this  problem  is  contained  in  Section  1.3.  The  advantageous  consequence 
is  that,  if  the  processor  uses  a  fairly  accurate  environmental  and  propagation  model, 
it  is  possible  to  achieve  source  localization  accuracies  which  far  exceed  those  which 
are  available  in  an  infinte,  homogeneous  medium  [11]. 

A  class  of  processors  which  has  been  developed  to  take  advantage  of  this  improved 
accuracy  and  to  eliminate  the  model  mismatch  problems  caused  by  the  use  of  the 
plane-wave  model  is  referred  to  as  the  Matched  Field  Processor.  First  proposed 
in  [12],  these  processors  use  fairly  complete  environmental  and  propagation  models 
to  make  a  priori  estimates  of  the  spatial  structure  of  received  signals  as  a  function 
of  environmental  condition  and  source  location.  The  processors  use  these  spatial 
structure  estimates  to  operate  on  the  received  sound  field  and  generate  estimates 
of  signal  parameters  of  interest.  The  spatial  structure  of  the  signal  of  interest  is 
parameterized  by  the  signal  replica  vector  as  defined  in  the  following  section. 


1.2  The  Signal  Replica  Vector 

The  signal  replica  vector  is  a  parameterization  of  the  spatial  characteristics  of  a 
propagating  signal  as  a  function  of  the  location  of  the  source  of  the  signal  and  the 
propagation  characteristics  of  the  medium.  Traditionally,  the  signal  replica  vector  is 
defined  for  a  narrowband  signal  propagating  through  a  time-invariant  medium.  In 
this  case,  the  signal  replica  vector  is,  to  within  a  complex  scaling  factor,  a  replica  of 
the  deterministic  narrowband  signal  emitted  by  a  source  at  the  location  z  as  received 
at  the  array  of  sensors.  Thus,  given  that  a  source  at  the  location  ^  emits  the  complex 
exponential  and  the  medium  is  time-invariant,  the  signal  received  by  the  array 

of  sensors  can  be  expressed  as 

1(0  =  B  i,^), 
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where  >4  is  a  complex  random  variable,  £(/,!,  ^  is  the  signal  replica  vector,  ^  is 
a  parameterization  of  the  characteristics  of  the  propagation  environment  and  the 
receiving  array  (e.g.,  the  sound  speed .  profile,  the  depth  of  the  ocean,  the  sensor 
locations,  etc.),  and  B  =  cA  for  some  complex  constant  c  which  depends  on  the 
signal  attenuation  and  progagation  delay  between  the  source  and  the  sensor  array 
and  the  manner  in  which  the  replica  vector  in  normalized.  The  signal  replica  vector 
is  usually  normalized  so  that  its  magnitude  equals  one. 

For  this  work,  the  signal  replica  vector  is  defined  in  a  stochastic  signal  framework 


as 


E[X.(/,2)Xj(/,i)  1^’ 


(1.1) 


where  A’t(/,i)  is  the  discrete-time  Fourier  transform  at  the  frequency  /  of  the  signal 
emitted  by  a  source  at  the  location  z  as  received  at  the  array  sensor,  2C(/»l)  is 
the  discrete- time  Fourier  transform  of  the  same  signal  as  received  at  the  entire  array 
of  sensors,  and  the  sensor  is  the  reference  sensor  of  the  array.  Thus,  the  signal 
replica  vector  is  the  normalized  cross-correlation  between  the  discrete-time  Fourier 
transform  of  the  signal  of  interest  as  received  at  the  reference  sensor  and  the  same 
signal  as  received  at  the  entire  array  of  sensors.  It  is  important  to  note  that  the 
signal  replica  vector  is  defined  in  terms  of  the  propagating  signal  as  received  at  the 
array  of  sensors.  This  new  defintion  is  used  for  two  reasons.  First,  it  explicitly  allows 
the  parameterization  of  the  spatial  structure  of  a  signal  emitted  by  stochastic  source 
and  which  propagates  through  a  random  medium.  Second,  this  parameterization 
incorporates  all  of  the  the  information  concerning  spatial  structure  of  the  signal  of 
interest  which  can  be  exploited  by  a  linear  processor  which  is  optimized  to  minimize 
a  mean-squared  error  criterion. 

When  the  source  at  the  location  z  emits  the  complex  exponential  and  the 

medium  is  time-invariant  as  described  previously,  the  signal  replica  vector  as  defined 
in  (1.1)  is,  to  within  a  complex  scaling  factor,  identical  to  the  traditionally  defined 
replica  vector  also  described  previously. 

The  term  in  the  denominator  of  (1.1)  is  a  normalization  term  which  yields 
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=  1-  Another  normalization  convention  is  proposed  in  Subsection  3.1.3.  A 
different  definition  of  the  replica  vector  is  proposed  in  Subsection  3.2.3.  This  defini¬ 
tion  is  similar  in  concept  to  (1.1)  in  that  it  is  based  upon  the  spatial  cross-correlation 
of  the  signal  of  interest  and,  in  a  time-invariant  propagation  environment,  these  defi¬ 
nitions  are  roughly  equivalent.  However,  in  a  random  time- varying  medium,  they  are 
different,  and  the  definition  proposed  in  Subsection  3.2.3  yields  better  results. 


1.3  Array  Processor  Performance  in  Uncertain 
Propagation  Environments 

As  mentioned  earlier,  array  processors  exploit  the  fact  that  the  spatial  characteristics 
of  a  signal  as  received  at  an  array  of  sensors  depend  on  the  location  of  the  source 
of  the  signal  in  order  to  differentiate  among  signals  emitted  by  sources  at  different 
locations.  High  resolution  processors  are  able  to  discriminate  among  signals  whose 
spatial  characteristics,  parameterized  here  by  the  signal  replica  vector,  differ  only 
slightly.  While  the  ability  to  discriminate  among  signals  whose  replica  vectors  differ 
only  slightly  provides  good  spatial  resolution,  it  also  makes  the  processor  sensitive  to 
changes  in  the  propagation  characteristics  of  the  environment.  A  small  change  in  the 
characteristics  of  the  propagation  medium  resulting  in  a  small  change  in  the  signal 
replica  vector,  may  cause  the  processor  to  inaccurately  estimate  the  location  of  the 
source  of  the  signal. 

Adaptive  processors,  such  as  Capon's  MVDR  Processor  [9],  are  particularly  sen¬ 
sitive  to  inaccuarate  or  imprecise  knowledge  of  the  characteristics  of  the  propagation 
environment  (referred  to  as  model  mismatch).  This  sensitivity  stems  from  the  fact 
that,  if  the  processor  incorrectly  calculates  the  signal  replica  vector,  then  the  received 
signal  emitted  by  a  source  at  the  location  of  interest  will  not  be  recognized  as  such. 
Consequently,  the  processor  will  attempt  to  reject  (i.e.,  filter  out)  that  signal.  The 
sensitivity  of  various  adaptive  and  non-adaptive  processors  to  model  mismatch  has 
been  analyzed  extensively  [13,  14, 15]. 

Several  approaches  to  reducing  the  sensitivity  of  adaptive  processors  to  model 
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mismatch  have  been  proposed.  The  most  commonly  proposed  approach,  which  is 
applicable  to  linear  processors,  is  to  add  an  additional  constraint  to  the  array  weight 
optimization  problem  which  places  an  upper  bound  on  the  norm  of  the  array  weight 
vector.  A  survey  of  these  methods  is  contsuned  in  [8].  The  motivation  for  these 
approaches  is  that  the  sensitivity  of  a  processor  to  spatially  uncorrelated  perturbations 
to  the  nominal  spatial  characteristics  of  a  signal  is  proportional  to  the  norm-squared 
of  the  array  weight  vector.  A  related  approach,  referred  to  as  the  Generalized  Cross- 
Spectral  Method  [16],  is  to  add  a  penalty  function  proportional  to  the  norm-squju'ed 
of  the  array  weight  vector  to  the  criterion,  which  is  minimized  or  maximized  by  the 
selection  of  the  optimal  array  weight  vector. 

Another  approach  to  reducing  the  sensitivity  of  linear  processors  to  model  mis¬ 
match  is  to  constrain  the  response  of  the  linear  weight-and-sum  beamformer  over  a 
range  of  environmental  conditions.  One  example  is  the  Multiple  Constraints  Method 
[17],  which  accomplishes  this  goal  by  placing  equality  constraints  on  the  response  of 
the  beamformer  at  a  number  of  locations  surrounding  the  location  of  interest.  Relying 
on  the  fact  that  the  signal  replica  vector  is  a  smooth  function  of  both  the  source  loca¬ 
tion  and  the  environmental  conditions,  the  equality  constraints  at  different  locations 
surrounding  the  location  of  interest  also  constrain  the  response  of  the  beamformer  at 
the  location  of  interest  for  various  environmental  conditions  which  are  close  to  the 
nominal  environmental  condition.  A  more  direct  approach  using  the  multiple  con¬ 
straints  approach  [18]  places  equality  constradnts  on  the  response  of  the  beamformer 
for  a  number  of  environmental  conditions  which  result  from  small  perturbations  to  a 
nominal  environmental  condition.  A  related  approach  [19]  uses  inequality  constraints 
on  the  response  of  the  processor  to  insure  that  for  various  environmental  conditions, 
the  actual  response  is  within  some  tolerance  factor  of  the  desired  response. 

A  third  approach  to  reducing  the  sensitivity  of  array  processors  to  model  mis¬ 
match,  which  is  not  limited  in  applicability  to  linear  beamformers,  is  the  random 
environment  approach,  here,  the  environmental  parameters  are  considered  random 
parameters  with  known  probability  distributions.  The  array  weight  vector  or  the  esti¬ 
mate  of  the  signal  parameters  are  chosen  to  mininimize  or  maximize  a  criterion  which 
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is  averaged  over  the  possible  values  of  the  environmental  parameters.  A  Maximum 
A  Posteriori  source  location  estimator  utilizing  this  approach  is  proposed  in  [20]. 

There  are  several  drawbacks  to  these  approaches.  First,  the  approaches  using 
linear  equality  constraints  lose  one  degree  of  freedom  in  the  beamformer  for  each 
constraint.  This  level  of  reduction  in  degrees  of  freedom  may  not  be  necessauy  to 
accomplish  the  desired  goal.  Second,  the  selection  of  the  response  levels,  the  norm 
bounds,  and  the  tolerance  factors  is,  in  general,  am  ad  hoc  procedure  without  clearly 
defined  criteria.  Finally,  the  processors  developed  using  the  random  environment  ap- 
proatch  may  exhibit  poor  performamce  for  particular  sets  of  environmental  conditions 
even  though  their  average  performance  is  good. 

This  work  seeks  to  develop  am  array  processor  which  exhibits  the  efficient  use  of 
degrees  of  freedom  and  the  interference  rejection  capability  charawrteristic  of  adaptive 
array  processors,  and  the  source  localization  capability  characteristic  of  the  matched 
field  processors  while  operating  with  only  approximate  information  about  the  prop¬ 
agation  characteristics  of  the  medium.  The  minmaoc  signal  processing  approach  is 
proposed  to  develop  such  a  processor.  The  minmax  approach  requires  that  an  error 
criterion  which  is  a  function  of  the  environmental  conditions  as  well  as  the  processor 
characterists  be  defined.  Using  this  criterion  as  a  measure  of  processor  performance, 
the  maximum  value  of  the  criterion  taken  over  a  user-specified  ramge  of  environmental 
parameters  is  minimized.  If  this  is  done  in  an  adaptive  mamner,  the  processor  should 
be  able  to  efficiently  use  its  degrees  of  freedom  to  improve  the  performamce  of  the 
processor  for  the  environmental  conditions  where  the  performance  is  most  critical. 

The  use  of  the  minmax  approach  to  develop  a  processor  which  is  insensitive  to 
modeling  uncertainties  has  been  studied  previously  ([21]  and  references  therein).  How¬ 
ever,  the  signal  processing  techniques  developed  therein  are  not  applicable  to  the  prob¬ 
lem  of  achieving  spatial  discrimination  in  an  uncertain  propagation  environment,  and 
are  not  adaptive  in  the  sense  described  in  Subsetion  1.1.  Therefore,  the  Adaptive 
Minmaoc  Matched  Field  Processor  described  in  Chapter  2  is  proposed  to  achieve  the 
goal  of  this  work. 
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Chapter  2 


Minmax  Array  Processing 


For  the  reasons  stated  in  Chapter  1,  a  minmax  approach  is  used  here  to  develop  an 
adaptive  array  processor  which  is  robust  with  respect  to  uncertainties  in  the  prop¬ 
agation  environment.  Section  2.1  presents  a  general  minmax  framework  for  signal 
processing  along  with  a  characterization  theorem  for  the  solutions  to  a  large  class  of 
minmax  signal  processing  problems.  Using  this  theorem  as  a  basis,  an  algorithm  for 
adaptive  minmax  matched  field  processing  is  developed  in  Section  2.2.  Section  2.3 
addresses  the  implementation  of  the  Adaptive  Minmax  Matched  Field  Processor.  A 
new  algorithm  is  developed  to  solve  a  particular  class  of  quadratic  minmax  problems 
which  includes  the  minmax  portion  of  the  Adaptive  Minmax  Matched  Field  Proces¬ 
sor.  This  algorithm  has  the  desirable  property  of  being  guar^mteed  to  converge  to 
an  exact  solution  in  a  finite  number  of  iterations.  Finally,  a  new  approach  to  the 
development  of  minmax  estimation  error  bounds  is  proposed  in  Section  2.4. 


2.1  The  Minmax  Signal  Processing  Framework 

The  framework  in  which  the  array  proc^sing  algorithm  described  in  Section  2.2  is 
developed  is  minmax  signal  processing.  In  general  terms  the  framework  addresses 
the  problem  of  developing  a  processor  whose  worst-case  performance  evaluated  over 
a  given  class  and  range  of  uncertainties  is  as  favorable  as  possible.  Specifically,  let 
se)  a  processor  parameterized  by  the  vector  j£  which  operates  on  an  observed 
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Figure  2-1:  The  Minmax  Signal  Processor 

signal  j/  to  generate  an  estimate  of  some  signal  or  parameter  of  interest  £  (Figure  2-1). 
The  (•)  symbol  denotes  an  estimate  of  the  variable  over  which  it  is  positioned  (e.g.,  x 
denotes  an  estimate  of  the  vector  x.).  The  set  of  allowable  values  for  the  parameter 
vector  t£  is  denoted  by  W.  In  the  case  where  g(y^,  xv)  is  a  linear  filter  with  N  taps,  the 
vector  j£  could  contain  the  filter  weights  and  W  could  be  the  space  of  N-dimensional 
complex  numbers  C^. 

The  parameters  which  govern  the  relationship  between  the  observed  signed  £  and 
the  signal  or  parameter  of  interest  x  are  referred  to  as  the  environmental  parameters 
and  denoted  by  the  vector  <f>.  In  the  context  of  array  processing  problems  where  y  is 
the  received  signal  and  2.  is  a  particular  signal  of  interest,  the  vector  4  could  contain 
the  location  of  the  array  sensors  or  the  phase,  gain,  and  directional  characteristics  of 
those  sensors.  It  could  also  contain  a  parameterization  of  the  interfering  signals  or  the 
characteristics  of  the  propagation  medium.  The  ability  of  any  pwticular  processor 
as  determined  by  the  choice  of  jr  to  estimate  £  depends  upon  the  particular  envi¬ 
ronmental  condition  under  which  the  processor  operates.  Thus,  a  particular  value  of 
m  which  yields  good  processor  performance  under  one  environmental  condition  may 
yield  very  poor  performance  under  another  environmental  condition.  A  real  valued 
error  function  is  used  as  a  figure  of  merit  to  evaluate  the  performance  of  any 

particular  processor  operating  under  any  particular  environmental  condition. 

If  the  processor  has  perfect  knowledge  of  the  environmental  conditions  (e.g., 
(f>  =  ^),  then  the  processor  parameters  can  be  chosen  to  minimize  £(22,^).  However, 
in  many  situations  the  processor  does  not  have  perfect  knowledge  of  the  environmen¬ 
tal  conditions  under  which  it  must  operate,  but  instead  knows  only  that  the  uncertdn 
environmental  parameter  4  within  some  range  denoted  by  the  set  The  pro- 
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cessor  should  then  be  designed  to  operate  over  this  entire  range. 

As  discussed  in  Section  1.3,  one  possible  approach  to  designing  the  processor  to 
operate  over  $  is  to  treat  ^  as  a  random  parameter  with  an  assigned  pdf  (probability 
distribution  function)  and  then  select  to  minimize  the  average  value  of  e{w, 
taken  over  $  with  respect  to  p^.  That  is, 

=  argmm^  p^{^)  e(i£,£,)  d^. 

However,  this  approach  requires  that  a  pdf  be  explicitly  assigned  to  ^  and  does 
not  necessarily  solve  the  problem  of  the  processor  performance  being  very  poor  for 
particular  environmental  conditions  under  which  it  may  have  to  operate. 

The  minmax  signal  processing  framework  makes  it  possible  to  avoid  these  prob¬ 
lems  when  selecting  w  by  treating  ^  as  a  nonrandom  parameter.  Then,  under  the 
assumptions  that  is  a  continuous  function  of  ^  for  every  w£W  and  $  is  a 

compact  set  contained  in  a  metric  space,  the  worst-case  performance  of  the  processor 
over  the  range  of  the  environmental  parameters  is  defined  as 

A(ui)=max£(i£,<^). 

A(ui)  is  referred  to  as  the  extremal  value  for  the  processor  parameter  vector  w.  The 
optimal  minmax  processor  parameter  vector  is  defined  as  that  which  minimizes  this 
extremal  value.  Mathematically,  this  is  stated  as 

^ 

As  in  any  optimization  problem,  the  specification  of  the  necessary  conditions  which 
must  be  met  by  the  optimal  solution  and  the  sufficient  conditions  which  guarantee  that 
a  solution  is  optimal  are  of  central  importance.  The  specification  of  such  conditions 
for  minmax  optimization  problems  requires  the  definition  of  extremal  points,  extremal 
point  sets,  the  convex  hull  of  a  set  of  points,  and  the  gradient  operator.  An  extremal 
point  is  any  environmental  point  ^  contained  in  9  at  which  the  error  function  e(t£,^) 
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achieves  the  extremal  value  A(i£).  The  extremal  point  set,  denoted  by  M{w),  is  the 
set  of  all  extremal  points.  That  is, 

€  $  I  =  A(i£)}. 

Given  any  set  of  points  A  contained  in  a  metric  space  5,  the  convex  hull  of  the  set 
A  in  S,  denoted  by  Ti  (^4),  consists  of  all  points  i  €  5  which  can  be  expressed  as  the 
convex  combination  of  the  points  €  A.  That  is, 

H{A)  =  {seS\3J>0,  aieA,  and  p,  C  R,  for  t  = 

j  J 

s.t.  p,  >  0  t  =  1,...,  J  ,  =  ^  i=5Zp«S,}- 

i=l  i=l 

A  final  required  definition  is  that  of  the  gradient  operator.  Let  e{w,^  be  any 
real  valued  scalar  function  of  the  vectors  id  and  Then  the  gradient  operator  of  e 
with  respect  to  w  is  any  vector  function  of  and  denoted  by  which 

is  continuous  with  respect  to  jn  and  ^  and  for  which  the  following  is  true:  There 
exists  a  real,  positive  scalar  constant  k  such  that  for  any  particular  processor  pa¬ 
rameter  vector  {x£o)  wid  environmental  condition  (^),  the  incremental  change  in  e 
corresponding  to  an  incremental  change  in  m  away  from  denoted  by  6j£,  is  equal 
to  it  <  V^e  ,6w>,  where  <  ,  >  denotes  the  inner  product.  A  formal  state¬ 

ment  of  this  definition  is  contained  in  Appendix  A.  The  definition  of  the  gradient  as 
a  vector  of  partial  derivatives  is  not  used  because  the  error  function  used  later  in  this 
chapter  is  not  differentiable  with  respect  to  the  elements  of  the  complex  vector  w. 

Given  the  preceding  definitions,  the  following  Minmax  Characterization  Theorem, 
which  is  a  generalization  of  that  given  in  Chapter  6  of  [1]  for  the  case  of  minmax 
approximation  with  differentiable  functions,  states  the  conditions  which  characterize 
the  optimal  solution  to  a  general  class  of  minmax  problems. 

Theorem  1  Let  $  be  a  compact  set  contained  in  a  metric  space  denoted 
by  r,  be  an  open  set  of  a  Euclidian  metric  space  denoted  by  E,  e  : 

X  ^  »  R  be  a  continuous  function  on  both  W  and  $  for  which,  at  each 
a  directional  derivative  with  respect  to  jj>  can  be  defined  on  $, 
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and  Vju£  ^u;,  ^  be  the  gradient  of  e  with  respect  to  w.  Then  a  necessary 
condition  for  to  be  a  solution  to  the  following  minmax  problem 


=  arg  mm  max  e{m,  ^), 


is  that 

If,  in  addition,  c  is  a  convex  function  of  jn  and  VT  is  a  convex  set,  this 
condition  is  a  necessary  and  sufficient  condition  for  jHo  £  W  to  be  the 
solution  to  the  stated  minmax  problem. 

A  proof  of  this  theorem  is  contained  in  Appendix  A.  This  theorem  states  that  a 
necessary  (and  sufficient  if  £  is  convex  on  W  and  W  is  itself  convex)  condition  for  the 
optimality  of  is  that  the  origin,  denoted  by  fi,  is  contained  in  the  convex  hull  of  the 
set  of  gradients  of  £  with  respect  to  jn  evaluated  at  the  extremal  points  of  £(tSo,^)- 
The  following  example  may  be  useful  to  clarify  the  definitions  and  the  concepts 
introduced  thus  far  and  to  provide  an  intuitive  interpretation  of  the  Minmax  Char* 
acterization  Theorem. 

Example:  Assume  that  jii  is  a  two-dimensional  real  vector,  =  R*,  <f> 
is  a  real  scalar  variable,  $  is  the  closed  interval  between  zero  and  one  (i.e. 

^  =  [0i  l])i  ^nd  e{ui,4>)  is  a  real-valued  scalar  function  which  :s  convex 
with  respect  to  iv  for  all  ^  and  continuous  with  respect  to  <f>. 

For  some  j£i.  Figure  2-2a  shows  the  error  function  plotted  as  a  function 
of  4>  with  the  extremal  value  and  the  extremal  point  set  M{wi)  = 

<t>2i  ^3}  labeled.  The  gradients  of  e  with  respect  to  evaluated  at 
and  each  of  the  three  extremal  points  are  shown  in  Figure  2-2b.  The 
convex  hull  of  this  set  of  vectors  is  the  shaded  region.  The  origin  is  not  in 
this  convex  hull  and  therefore  jui  is  not  an  optimal  solution.  This  can  be 
seen  by  noting  that,  if  we  can  choose  a  direction  vector  such  as  i  for  which 
<  0  for  t  =  1,  2,  3,  then  the  initial  change  in  e  evaluated 
at  each  of  the  extremal  points  will  be  negative  as  we  move  away  from  j£] 
in  the  direction  of  i.  Since  the  value  of  £  is  simultaneously  reduced  at 
each  of  the  extremal  points  as  we  move  away  from  the  extremal  value 
of  £  will  be  reduced  as  we  move  away  from  and  therefore  nh  cannot  be 
an  optimal  solution. 

For  some  iii2,  Figure  2-3a  shows  the  error  function  plotted  as  a  function 
of  <t>  with  the  extremal  value  A(j^)  and  the  extremal  point  set  = 

^5,  labeled.  The  gradients  of  e  with  respect  to  jn  evaluated 
at  1^2  end  each  of  the  three  extremal  points  are  shown  in  Figure  2-3b 
with  the  convex  hull  denoted  as  before.  In  this  case,  the  origin  is  in  the 
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convex  hull  and  therefore  Wj  is  an  optimal  solution.  This  can  be  seen 
by  noting  that,  for  any  direction  vector  such  as  which  we  choose,  the 
inner  product  will  be  greater  than  zero  for  at  least  one 

of  the  three  gradient  vectors  (in  the  case  shown,  d* V^e  (wi.  6fi)  >  0). 
Therefore,  as  we  move  away  from  in  any  direction  the  initial  change 
in  £  evaluated  at  one  or  more  of  the  extremal  points  will  be  positive.  Since 
the  value  of  £  increases  at  one  or  more  of  the  extremal  points  as  we  move 
away  from  the  extremal  value  of  £  will  be  increased  as  we  move  away 
from  t£2.  Therefore,  is  a  locally  optimal  solution.  However,  since  £  is 
a  convex  function  of  w  for  all  A(in)  is  also  a  convex  funtion  of  w.  (See 
Lemma  A. 2  in  Appendix  A).  Therefore,  1^2  is  a  globally  optimal  solution. 


2.2  The  Adaptive  Minmax  Matched  Field  Processor 


2.2.1  Signal  Model 


The  Adaptive  Minmax  Matched  Field  Processor  takes  as  its  input  the  signal  received 
by  an  array  of  sensors  which  has  been  low-pass  filtered  to  prevent  frequency  domain 
aliasing  and  then  sampled.  This  input  is  denoted  by  the  vector  time  series  This 
input  signal  is  assumed  to  be  the  sum  of  propagating  background  noise  generated 
by  spatial  spread  sources,  such  as,  breaking  surface  waves,  sensor  noise  which  is 
assumed  to  be  spatially  white,  and  propagating  signals  generated  by  spatially-discrete 
point  sources  such  as  marine  mammals,  ships,  etc..  2,[m,  z]  denotes  the  time  sampled 
received  signal  which  was  emitted  by  a  point  source  at  the  spatial  location  z.  n[m] 
denotes  the  sum  of  the  sensor  noise  and  the  received  propagating  background  noise. 
It  is  assumed  that  n{m]  and  i[m,  z]  are  uncorrelated  zero  mean  wide-sense  stationary 
random  processes  for  each  i  and  that  £[m,£i]  and  2,[m,£2]  uncorrelated  for  any 
two  source  locations  Zj  ij.  Thus,  j/[m]  is  a  zero  mean  random  process  represented 
by 


=  dm] +  53  £["*.!]• 

i 

The  modeling  of  ^  ^  mean  random  process  can  include  a  signal 

emitted  by  a  stochastic  source  propagating  through  either  a  deterministic  or  a  random 
environment,  or  a  signal  emitted  by  a  deterministic  source  propagating  through  a 
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random  environment. 


2.2.2  Processor  Structure 

The  Adaptive  Minmax  Matched  Field  Processor  was  developed  to  achieve  the  effective 
sidelobe  control  which  is  characteristic  of  adaptive  processors  such  as  Capon’s  MVDR 
Processor  [9],  and  the  improvement  in  spatial  resolution  provided  by  matched  field 
processing  techniques  such  as  those  presented  in  [11],  without  exhibiting  the  extreme 
sensitivity  to  mismatch  in  the  estimation  of  the  characteristics  of  the  propagation 
environment  which  is  exhibited  by  these  algorithms  and  techniques  [13,  17).  The 
quantity  estimated  by  the  processor  is  the  average  power  in  a  selected  frequency 
component  of  the  signal  emitted  by  a  point  source  at  a  location  of  interest  as  received 
at  one  array  sensor  (the  reference  sensor).  The  location  of  interest  is  referred  to  as 
the  array  focal  point.  The  signal  emitted  by  a  point  source  at  the  array  focal  point 
and  received  at  the  reference  sensor  is  referred  to  as  the  desired  signal  and  denoted 
by  x*[m,i].  Here  the  sensor  is  the  reference  sensor  and  z  is  the  array  focal  point. 
Unlike  the  case  of  traditional  array  processors,  the  desired  signal  is  not  the  signal  as 
emitted  by  a  source  at  the  array  focal  point.  Instead,  the  desired  signal  is  the  signal 
emitted  by  a  source  at  the  array  focal  point  as  received  at  the  reference  sensor.  The 
array  focal  point  can  be  swept  through  space  and  the  selected  frequency  can  be  swept 
through  the  frequency  spectrum  to  generate  an  estimate  of  the  average  power  in  the 
desired  signal  as  a  function  of  spatial  location  and  temporal  frequency.  This  estimate 

A 

is  denoted  by 

Conceptually,  the  processor  which  generates  this  estimate  consists  of  three  mod¬ 
ules  (Figure  2-4).  The  first  module  divides  the  time-sampled  signal  received  by  the  ar¬ 
ray  j/[m]  into  segments  M  samples  in  length  which  may  be  overlapping,  and  computes 
the  vector  discrete-time  Fourier  transform  of  each  segment  at  the  selected  frequency 

maeO 

/  indicates  the  segment  number,  j{^*[m]  is  the  m‘*  sample  of  the  segment,  and  At 
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is  the  sampling  period.  Here,  /  is  the  frequency  expressed  in  cycles  per  second  which 
satisfies  |  /  ]<  /  is  not  the  normalized  frequency  expressed  in  cycles  per  S2unple 

which  satisfies  |  /  |<  5.  The  linearity  of  the  Fourier  transform  yields 

Y!(f)  =  '£2L\U)  +  li'(f), 

i 

where  the  summation  is  over  the  locations  of  the  point  sources.  The  transformed  seg¬ 
ments  axe  known  as  “snapshots”  and  denotes  the  snapshot  of  the  segment 

of  x[m,z].  Yi{f)  denotes  the  discrete-time  Fourier  transform  of  the  segment  of 
the  signal  received  by  the  ***  array  sensor.  In  effect,  the  first  module  is  a  temporal 
filter  which  selects  the  frequency  component  of  interest  in  the  received  signal. 

The  snapshots  of  the  received  signal  are  the  inputs  to  the  second  module  which  is 
a  linear  weight-and-sum  beamformer.  This  beamformer  computes  an  estimate  of  the 
Fourier  transform  of  the  /‘*  segment  of  the  desired  signal  using 

Xlif,!)  = 

where  tjn  is  the  array  weight  vector.  The  beamformer  is  a  spatial  filter  which  attempts 
to  pass  only  the  desired  signal  (i.e.,  that  which  was  emitted  by  a  source  at  the  array 
focal  point  z)  while  rejecting  all  other  signals  received  by  the  array  sensors.  The  final 
module  computes  an  estimate  of  the  average  power  in  the  desired  signal.  The  overbar 
indicates  the  sample  mean  taken  over  all  /.  That  is,  if  L  is  the  number  of  segments 
used  in  estimating  <r*(/,z),  then 

^  /=! 

While  this  structure  is  the  same  as  that  used  by  many  array  processors  such  as 
Capon’s  M  VDR  Processor,  the  unique  feature  of  this  processor  is  the  manner  in  which 
the  array  weight  vector  j£  is  calculated.  For  this  processor,  the  array  weight  vector  is 
the  solution  to  a  minmax  optimization  problem  where  the  error  £  is  a  measure  of  the 
spatial  filter’s  ability  to  pass  the  desired  signal  without  distortion  while  rejecting  the 


28 


Figure  2-4:  Array  Processor  Structure 


interfering  signals  in  a  given  propagation  environment. 

2.2.3  The  Minmax  Array  Weight  Problem 

For  any  particular  array  focal  point,  frequency,  array  weight  vector,  and  propagation 
environment,  the  error  function  for  the  Adaptive  Minmax  Matched  Field  Processor 
is  the  a  priori  mean-squared  error  in  the  estimation  of  XkiftS)  conditioned  on  the 
characteristics  of  the  propagation  environment.  That  is 

=  Ell  |»  I  £  (2.1) 

where  the  characteristics  of  the  propagation  environment  are  parameterized  by  the 
vector 

For  a  given  array  focal  point  and  frequency,  the  optimal  array  weights  are  defined 
as 

=  arg  min  max  £(/,£,!{;,  ^),  (2.2) 

s-eC" 

where  N  is  the  number  of  array  sensors  and  9  is  the  user  specified  range  of  the 
environmental  parameters  over  which  the  processor  must  oi>erate. 

Under  the  assumption  stated  earlier  that  the  desired  signal  and  the  interfering 
signals  are  uncorrelated,  (2.1)  can  be  rewritten  as 

( £  -  (2.3) 

2Real(Eti:(/,j)X:(/,i)  |  #  ffi)  +  in*  £[!:(/)»/)*  lilsE, 
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where  the  superscript  *  denotes  complex  conjugate. 

The  expectation  in  the  last  term  of  (2.3)  is  the  cross-spectral  correlation  matrix  of 
the  received  signal  conditioned  on  the  environmental  parameter  The  cross-spectral 
correlation  matrix  is  the  parameterization  used  by  the  processor  to  characterize  the 
spatial  structure  of  the  total  signal  field,  and  it  is  the  input  to  the  processor  which 
enables  the  processor  to  adapt  to  reject  unwanted  signals.  Here,  the  matrix  will  not  be 
treated  as  a  function  of  the  particular  environmental  conditions  or  the  characteristics 
of  any  particular  propagating  signal.  Instead  it  wil  be  treated  as  a  property  of  the 
total  signal  held.  Therefore,  the  conditioning  of  the  expectation  in  the  last  term 
of  (2.3)  is  dropped  and  the  actual  ensemble  cross-spectral  correlation  matrix,  S(/), 
is  used.  In  most  cases,  this  ensemble  cross-spectral  correlation  matrix  is  unknown 
to  the  processor.  Therefore,  the  sample  cross-spectral  correlation  matrix  given  by 
S(/)  =  2;  will  be  substituted  for  S(/).  Nothing  in  the  derivation  of 

the  algorithm  in  the  remainder  of  this  chapter  depends  upon  this  substitution. 

The  expectation  in  the  second  term  of  (2.3)  can  be  expressed  as 


EmU)x;{f,s)  \i] 

E[x,{f,s.)x;{f,z)  1^- 


The  quotient  is  the  signal  replica  vector  defined  in  Section  1.2  as 


,,  ..a  1^1 


Therefore,  the  second  term  can  be  expressed  as 


2  E[jrt(/,£)x;(/,i)  I  ^Real(£*(/,i,i)  m)  (2.4) 


The  signal  replica  vector  in  this  factorization  is  the  means  by  which  the  a  priori  model 
of  the  dependence  of  the  desired  signal’s  spatial  characteristics  on  the  environmental 
conditions  is  incorporated  into  the  processor. 

The  expression  E[X*(/,i)A']J(/,i)  |  ^  appears  in  the  first  term  of  (2.3)  and,  as 
a  result  of  the  factorization  in  (2.4),  will  also  appear  in  second  term  of  (2.3).  This 
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expression  is  the  conditional  average  power  in  the  desired  signal,  and  will  be  replaced 
by  the  actual  average  power  in  the  desired  signal  a^if,z)  =  E[Xk{f,s)X}^{f,z)]. 
Given  the  f2K;torization  and  the  substitutions  detailed  above,  the  error  criterion  can 
be  expressed  as 


«(/,!,  =  <?’(/, l)  -  2<r’(/,i)Real(£*(/,z,^)3ii)  +  S(/)  jn,  (2.5) 

where  the  dependence  of  the  error  on  the  average  power  in  the  desired  signal  is 
explicitly  shown. 

The  optimal  array  weights  minimize  the  maximum  value  of  this  error  taken  over 
the  operating  range  of  the  environmental  parameters.  Conceptually,  they  can  be 
considered  those  of  a  data-adaptive  Wiener  filter  which  is  robust  with  respect  to 
changes  in  the  spatial  correlation  of  the  signal  to  be  estimated. 

The  Adaptive  Minmax  Processor  described  in  this  subsection  can  also  be  inter¬ 
preted  as  an  efficient  implementation  of  a  bank  of  MVDR  Matched  Field  Processors, 
each  using  a  different  assumed  value  of  the  environmental  parameter  vector  {£}  and 
therefore  of  the  signal  replica  vector  (£(/,!, ^))  (Figure  2-5).  The  range  of  assumed 
values  of  ^  is  the  range  of  environmental  conditions  over  which  the  processor  is  de¬ 
signed  to  operate.  The  Adaptive  Minmax  Processor  output  is  the  output  of  the 
MVDR  Processor  with  the  largest  estimated  average  power.  The  derivation  of  this 
interpretation  is  detailed  in  Section  3.1  where  the  processor  bank  interpretation  is 
equivalent  to  the  Two-Stage  MVDR  Matched  Field  Processor  interpretation. 

2.2.4  The  Minmax  Array  Processing  Algorithm 

A  problem  in  calculating  the  solution  to  the  minmax  problem  in  (2.2)  is  that  the  error 
criterion  and  therefore  the  optimal  array  weights  are  functions  of  However, 

the  array  processor  does  not  have  knowledge  of  the  true  value  of  <t*(/,  i),  but  estimates 
it  to  be  the  sample  average  power  in  the  output  of  the  weight-and-sum  beamformer. 
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Figure  2-5:  The  MVDR  Processor  Bank 

That  is, 


»V<i)  =  i-yi(/.i)i=  =  =  tt‘s(/)E. 

^  <=i 

Therefore,  the  error  criterion  and  optimal  array  weights  depend  upon  the  average 
power  in  the  frequency  component  of  interest  in  the  desired  signal  amd  the  estimate 
of  this  average  power  depends  upon  the  array  weights  used  by  the  beamformer.  This 
interdependence  makes  it  necessary  to  jointly  calculate  the  optimal  array  weights  and 
estimate  the  average  power. 

This  joint  calculation  and  estimation  problem  is  addressed  by  requiring  that  the 
average  power  in  the  desired  signal  used  when  calculating  the  optimal  array  weights 
weights  be  equal  to  the  estimated  average  power  in  the  desired  signal  resulting  from 
the  use  of  those  weights.  The  joint  array  weights  calculation/power  estimation  prob¬ 
lem  can  be  posed  as  finding  so  that 

UicpiiA  l))  =  “g  min  m^  £(/,  i,  m,  i)),  (2.6) 
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and 


»V,i)  =  s(/)  !£.„(/, (2.r) 

where  the  dependence  of  the  optimal  array  weights  on  the  average  power  is  explicitly 
shown. 

A  trivial  solution  to  the  problem  expressed  in  (2.6)  and  (2.7)  is  ix>^//,g,0)  =  Q 
and  d*(/,z)  =  0.  The  existence  of  a  non-trivial  solution  and  an  algorithm  for  jointly 
finding  the  nonzero  JiJop<(/>i»^^(/>l))  ^^(/»i)  which  satisfy  (2.6)  and  (2.7)  is 

based  upon  the  following  theorem,  a  proof  of  which  is  given  in  Appendix  B. 


Theorem  2  Let  <tI  be  any  real  positive  number,  $  be  a  compact  set 
contained  in  a  metric  space,  be  a  continuous  function  on  and 


=  arg  min  max£(/,2,iii,^. 


Then  for  any  real  non-negative  the  solution  to  the  problem 

=  arg  min  max £(/,!, i£,^,  o’) 
skcC 


is  given  by 


Therefore,  given  yLopt{f,?y<^l)  for  any  real  positive  er*,  the  solution  to  (2.6)  can  be 
expressed  as 

liJop<(/,lio’(/,l))  =  (2.8) 

Subsituting  (2.8)  into  (2.7)  yields 


=  (»’(/, l)/o’J)’  S(/)  !£.,)(/,£, O). 


(2.9) 


Solving  (2.9)  for  d*(/,i)  yields 

»V,i)  =  K’)’  S(/)  !£«(/, (2-10) 


The  optimal  array  weights  which  are  consistent  with  this  average  power  estimate  can 
be  calculated  using  (2.8).  Therefore,  the  following  algorithm  can  be  used  to  solve  the 
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joint  optimal  array  weights  calculation  and  average  power  estimation  problem  defined 
by  (2.6)  and  (2.7). 


1.  Assign  any  real,  positive  value  to  cr^.  Calculate  SVapt{ftS.i<^l)  ^  given  by 

aUptif,  arg  min  max e{f,  iti,  <f>,  al). 

jteC" 

2.  a\f,z)  =  (criy  «,(/,!,  a,^)  S(/)  tn^,(/,2,cT|))-» 

3.  li2opt(/,l,^^(/,l))  =  U2op«(/,l,<yo) 

Step  1  can  be  implemented  using  any  complex  minmax  approximation  algorithm 
capable  of  handling  quadratic  forms.  The  development  of  an  efficient  algorithm  to 
solve  this  particular  minmax  problem  is  detailed  in  Section  2.3. 


2.3  Solution  of  the  Minmax  Problem 

Step  1  of  the  array  processing  algorithm  developed  in  Section  2.2  requires  the  solution 
of  a  quadratic  minmax  problem.  A  major  impediment  to  the  implementation  of  min¬ 
max  signal  processing  algorithms  has  been  their  relatively  high  computational  com¬ 
plexity.  The  minmax  signal  processing  solutions  which  have  gained  widespread  use 
are  those  for  which  either  analytic  solutions  are  available  (e.g.,  the  Dolph-Chebyshev 
window  [22])  or  those  for  which  computationally  efficient  algorithms  have  been  devel¬ 
oped  (e.g.,  real  linear- phase  minmax  filter  design  using  the  Parks- McClellan  algorithm 
[23]).  By  exploiting  the  special  structure  of  the  quadratic  minmax  problem  contained 
in  Step  1  of  the  array  processing  algorithm,  the  algorithm  developed  in  this  section 
to  solve  the  minmax  problem  is  relatively  efficient  computationally  and  is  guaranteed 
to  converge  in  a  finite  number  of  iterations. 
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2.3.1  Characterization  of  i4Jopt(/,i,<Tj) 

From  Step  1  of  the  algorithm  in  Section  2.2,  the  minmax  problem  which  must  be 
solved  is 

^o)  =  arg  min  max e(/, i, jn,  a’), 

where  £(/,l,3£,^,  <To)  is  the  conditional  mean-squared  estimation  error  and  can  be 
expressed  as 


=  -  2<r’Real(£*(/,i,^  iv)  +  w!'  S(/)  3^;.  (2.11) 

The  following  characterization  theorem  for  the  minmax  array  weight  problem  states 
the  necessary  and  sufficient  conditions  satisfied  by  A  proof  of  this 

theorem  is  contained  in  Appendix  B. 

Theorem  3  Let  $  be  a  compact  set  contained  in  a  metric  space  and 
£(/,!,  ^)  be  a  continuous  function  on  $.  Then  a  sufficient  condition  for 
to  be  a  solution  to  the  following  minmax  problem 

=  arg  min  max£(/,z,ui,(A,a*), 

♦€♦ 


is  that 

3  J  >  0  , 

(2.12) 

and 

Q  M{w^), 

(2.13) 

such  that 

a  €  «  ({(S(/)!ii,  -  cIiU.lS)  I  #€*(!£.))). 

(2.14) 

A  necessary  condition  for  uj<,  to  be  a  solution  to  the  following  minmax 
problem 

S^optC/,  i.,  =  "g  min  m^  e(/,  z,  w,  <r*), 

yi,€C" 

is  that 

3J€  2Ar-|.l}  (2.15) 

for  which  (2.13)  and  (2.14)  are  satisfied. 
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(2.14)  is  equivalent  to 


a = E  ft  (§(/)!&  -  <T2i(/,i,&)),  (2.16) 

•=l 

where  Pi  >  0  and  Pi  =  !•  Algebraic  manipulation  of  (2.16)  yields  the  following 
expression  for  Wg. 

•si 

Therefore,  the  following  corollary  to  Theorem  3  states  an  equivalent  set  of  necessary 
and  sufficient  conditions  satisfied  by  ii2opt(/,i,0’|). 

Corollary  1  Let  $  be  a  compact  set  contained  in  a  metric  space  and 
£(/i  ^  be  a  continuous  function  on  Then  a  sufficient  condition  for 

to  be  a  solution  to  the  following  minmax  problem 

1,(^1)  =  arg  min  maxe(/,r,i£,^,a^), 

is  that 

3J  >  0,  (2.17) 

3M(l!!.)  =  {^,,...,£,}CJl/(ifc),  (2.18) 

and 

J 

3pi,...,pj  €R,  >  0,  p,  =  1,  (2.19) 

tsl 

such  that  ^ 

=  <^oS(/)"‘  Sp<£(/»^>^)-  (2-20) 

«sl 

A  necessary  condition  for  to  be  a  solution  to  the  following  min¬ 

max  problem 

m^tU,  1,(^1)  =  arg  min  maxe(/,z,ui,<6,0. 

is  that 

3J€  {l,...,2Ar  +  l}  (2.21) 

for  which  (2.18)  through  (2.20)  are  satisfied. 

Therefore,  if  the  appropriate  set  of  extremal  points  and  convex  weights  can  be 
determined,  the  optimal  array  weight  vector  can  be  calculated  directly.  The  minmax 
problem  can  thus  be  reformulated  as  jointly  finding  the  J,  •  • . ,  Pi, . . .  ,pj,  and 
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UU  £  which  satisfy  (2.17)  through  (2.20).  The  key  to  finding  the  appropriate 
set  of  extremal  points,  convex  weights,  and  array  weight  vector  lies  in  reformulating 
the  minmax  estimation  problem  as  a  Wiener  filtering  problem  with  the  uncertain 
environmental  parsuneter  treated  as  a  random  parameter. 


2.3.2  The  Least  Favorable  PMF  Random  Parameter  Framework 

From  Section  2.1,  in  the  minmax  signal  processing  framework  the  uncertain  environ¬ 
mental  parameter  is  treated  as  a  nonrandom  parameter.  However,  an  efficient  method 
for  calculating  the  optimal  minmax  array  weights  can  be  developed  by  treating  the 
uncertain  environmental  ^^arzuneter  as  a  random  parameter  with  a  particular  proba¬ 
bility  function  and  then  solving  for  the  minimum  mean-squared  error  array  weights 
(i.e.,  Wiener  filter  weights).  As  a  computational  necessity  zmd  to  ensure  that  £(/,i,  <i>) 
is  a  continuous  function  on  the  range  of  the  environmental  parameter  will  be  sam¬ 
pled  (i.e.,  ^  =  {^j, .  • .  and  the  minmax  problem  will  be  solved  on  this  discrete 

set  of  environmental  conditions.  The  issues  associated  with  the  effect  of  this  sampling 
are  treated  in  Subsection  3.1.4.  Therefore,  the  probability  function  assigned  to  the 
environmental  parameters  will  take  the  form  of  a  pmf  (probability  mass  function) 
rather  than  the  form  of  a  pdf.  The  pmf  will  be  denoted  by  £  e  and  is  defined  by 

p,=Probability[^  =  ^.]. 


Since  £  is  a  pmf,  it  must  satisfy 


K 

Pi  >  0  and  5^  Pi  =  1  . 

•si 

These  are  the  same  conditions  which  must  be  satisfied  by  the  convex  weights  used  to 
calculate  the  points  in  the  convex  hull  of  a  set  of  points  and  therefore  by  the  weights 
which  are  used  to  calculate  the  optimal  minmax  weight  vector  in  (2.20).  This  fact 
will  be  used  to  relate  the  Wiener  filter  weight  vector  to  the  optimal  minmax  weight 
vector. 
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For  any  pmf  and  array  weight  vector,  the  mean-squared  estimation  error  is 

=  E[l  Kif^2.)  -  Xlif.z)  p]  =  53  Pi  (2.22) 

tsl 

Substituting  (2.5)  into  (2.22)  and  carrying  out  the  algebraic  manipulation  yields 

K 

£(/.!,  l£,E,<yo)  =  -  2<To  Real((53  P.  £(/,1,^))V)  +l£*S(/)ui.  (2.23) 

Define  the  Wiener  filter  weight  vector  to  be 

arg  min  £(/,!,  t£,£,(T^). 

Then,  unconstrained  complex  quadratic  minimization  methods  yield 

2£mm.e(/,l,^^£)  =  <^0  S(/)“^  Y.  Pi  lif  y  ^) ■  (2-24) 

•=l 

(2.20)  and  (2.24)  differ  only  in  the  respect  that  in  (2.20)  the  summation  is  over  J 
extremal  points  contained  in  M{iVg)  while  in  (2.24)  the  summation  is  over  all  envi¬ 
ronmental  conditions  contained  in  $.  Therefore,  if  a  pmf  £  can  be  found  such  that  pi 
is  greater  than  zero  only  if  ^  €  A/(u;,„„„j(/,2,<7^,£)),  then  the  summation  in  (2.24) 
will  effectively  be  over  only  the  extremal  points  contained  in  A/(j£„„„,e(/,l,<^o>£))- 
In  this  case,  the  sufficient  conditions  in  Corollary  1  will  be  satisfied  by  A”,  pi, . . .  ,p/,'. 

The  key  to  finding  the  correct  pmf  can  be  discerned  by  observing  the  behavior  of 
the  mean-squared  error  function  £(/,l,«imm»e(/>l7<^l>£)5£»<^2)  conditional 

mean-squared  error  function  £(/,!,  as  £  is  allowed  to  vary. 

£ifyS.,Ulmm,eifyly(^lyE)yEy<^l)y  will  be  abbreviated  as  e(/,2,  £>  ^0)5  1® 

the  minimum  mean-squared  estimation  error  achievable  by  any  array  weights  given 
the  pmf  £.  That  is, 

£{f,l,mnm,eyE^^l)  ^  "lin  £(/,  i,  U;,  £,  ff’)  . 
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£(/,  1,  Wjnmteif,  1,  e), <^l),  which  Will  be  abbreviated  as  £(/,  z,  tr^„„e(£)>  is 

the  conditional  mean-squared  estimation  error  achieved  by  the  Wiener  array  weights 
for  the  pmf  £  conditioned  on  the  environmental  variable 
Define  as 

arg  min  e{f,2,w,£,al)  =  S(/)"^  £(/,z,^.). 

Then,  expressed  as 

K 

t=l 

Therefore,  as  pm  is  increased  incrementally  and  p„  is  decreased  incrementally  for 
some  m  and  n  (£  is  a  pmf  and  therefore  must  be  decreased  for  some  n  when  pm  is 
increased  for  some  m),  should  become  more  like 

and  less  like  w^t{f,z,£^,<Tl).  Therefore,  e(/,z,ii)„„,.*(p),^,(7*)  should  decrease 
and  £(f,2<,Wmm$e{E)^i.n'^o)  should  increase.  Furthermore,  since 

K 

=  5Z  P.  £(/.  1,  I£mm.e(£)>  £•>  >  (2-25) 

<=1 

£(/,!,  t£mm«,£,<^o)  should  increase  if£(/,z, !£„„„„(£), ^^,(7^)  > 

With  this  intuition  in  mind,  consider  the  following  example. 

Example:  Assume  that  ^  is  a  discrete-valued  real  scaletr  variable  zmd 

that  for  some  pmf  £^,  f(/,2,uimm»e(£p)>^»^o)  i®  as  shown  in  Figure  2- 
6.  Then,  if  we  select  any  extremal  point  €  A/ (£<,))  and  any 
nonextremal  point  ^  A/(ui„,TO,e(£p))  ^ot  which  Po„  >  0,  we  can  increase 
Po„  and  decrease  po„  incrementally.  The  effect  ill  be  that  the  mean- 
squared  estimation  error  e( /,i,JSTOm*e»£,»^o)  an^-  the  conditional  mean- 
squared  estimation  error  conditioned  on  ^  Ji2mm»e(£o)»  ^n,<^o)) 

will  increase,  and  the  conditional  mean-squared  estimation  error  con¬ 
ditioned  on  <i>  =  4>n  (£(/ii»lSmm»e(£^)>^n»<^|))  will  decrease.  Further¬ 
more,  since  £(/,i,j£mmM(£<,)>^j<^o)  he  decreased  at  the  extremal 
point,  the  extremal  value  E^{mmm$t{Ej)  he  decreased.  Conceptu¬ 
ally,  this  process  can  be  repeated  until  Pn  =  0  for  all  nonextremal  points 
<t>n  ^  Mil&mm$eiEo^)-  When  this  is  the  case,  the  sufficient  conditions  in 
Corollary  1  will  be  satisfied  and  lSJn>m»e(£j,)  he  the  optimal  minmax 
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e(f.z,w^p^.<|),o|) 


A(w,n»j(Po))  -  “ 
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m 

Figure  2-6:  Conditional  Mezm-Squared  Estimation  Error 

array  weights.  Any  further  change  to  involving  an  extremal  and  a 
nonextremal  point  will  require  lowering  the  probability  corresponding  to 
the  extremal  point  and  raising  the  probability  corresponding  to  the  nonex¬ 
tremal  point.  This  would  result  in  lowering  the  mean-squared  estimation 
error  £(/,!,  desired  should  be  that  which 

maximizes 


The  intuitive  result  illustrated  in  the  preceding  example  is  formalized  in  the  fol¬ 
lowing  theorem,  a  proof  of  which  is  contjuned  in  Appendix  B. 

Theorem  4  Let  P  be  the  set  of  all  possible  pmfs  which  may  be  assigned 
to  $  and  define  the  least  favorable  pmf  as 

arg max  min  £(/,!,!£,£, (7*)  = 

(2.26) 

Then 

arg  ndn  maxe(/,z,u;,^,(T*)  =  • 

This  theorem  states  that  the  least  favorable  pmf  is  the  pmf  for  which  the  minimum 
mean-squared  estimation  error  is  maximized  and  the  Wiener  filter  weight  vector  for 
the  least  favorable  pmf  is  also  the  optimal  minmax  array  weight  vector. 
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2.3.3  Solving  for  the  Least  Favorable  PMF 

By  combining  (2.23)  and  (2.24)  and  carrying  out  algebraic  manipulation,  the  mini¬ 
mum  mean-squared  estimation  error  can  be  expressed  as 

K  K 

=  <^l  {l-trl  (J3  S(/)“*  (j;  Pi2(/,S,^)))  •  (2.27) 

tsl  tsl 

Defining  the  matrix  as  Q(/,i)  =  [£(/,!, ^i),...,£(/,i,^)l,  (2.27)  can  be 

rewritten  as 


£(/,!, «i,£,<yo)  =  -^o£‘W.i)'‘S(/)"*(?(/,^)£)  . 

Finding  the  £  to  maximize  this  quantity  is  equivalent  to  finding  the  £  to  minimize 
the  matrix  quadratic  product  in  the  second  term.  Therefore,  (2.26)  can  be  rewritten 
as 


Hi/  =  "S  max  imn  £(/,  i,  jr,  £,  <t’)  =  arg  mm  £*(?(/,  i)'*§(/)"*  (?(/,  i)£ .  (2.28) 

Since  Q{fiZ)^S(f)~^Q{f,z)  is  a  Hermitian  matrix  and  £  is  a  real  vector. 

Defining  the  real  matrix  T{f,z)  =  Real((5(/,z)^S(/)"*Q(/,z)),  explicitly  defining 
the  set  P,  and  letting  the  vector  e  :=  [1, . .  • ,  1]*,  the  optimization  problem  in  (2.28) 
can  be  expressed  as  the  following  real  constrained  quadratic  minimization  problem. 

=  “-gmm  (2.29) 

A  solution  to  (2.29)  is  guaranteed  to  exist  because  £*  T(/,£)  £  is  a  continuous 
function  of  £  and  the  set  P  is  a  compact  set.  Since  S(/)  is  a  positive  definite  Her¬ 
mitian  matrix,  is  a  positive  semi-definite  Hermitian  matrix. 
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Therefore,  is  a  positive  semi-definite  symmetric  matrix  and  £*  T{f,z)  £  is  a 

convex  function  of  £.  There  are  a  number  of  algorithms  available  for  solving  linearly 
constrained  convex  quadratic  minimization  problems  such  as  (2.29).  An  efficient  al¬ 
gorithm,  based  on  complementary  pivot  theory,  was  proposed  by  Lemke  in  [24]  and  is 
described  in  a  more  readable  form  in  Chapter  11  of  [25].  The  basic  intuition  behind 
the  use  of  the  complementary  pivot  theory  to  solve  a  quadratic  problem  is  that  the 
necessary  and  sufficient  conditions,  known  as  the  Kuhn-Tucker  conditions  for  £^  to 
be  a  solution  to  the  problem  in  (2.29),  are  largely  a  set  of  linear  equations.  The 
Kuhn-Tucker  conditions  can  be  written  as  [25] 


g*£^  =  1,  3tt  €  R,  and  Bn  >  0  s.t. 

(2.30) 

t;*£^  =  0  and 

(2.31) 

H  -  r(/,i)£^  +ti  c  =  0  . 

(2.32) 

Given  that  £^  >  0  and  u  >  0,  (2.31)  requires  that  if  po-  >  0  then  v,  =  0  and  if  Vi  > 
0  then  =  0.  This  condition  is  known  as  the  complementary  slackness  condition; 
Po,  and  Vi  are  known  complements  of  each  other  and  together  they  are  known  as 
a  complementary  p2ur.  The  primary  fact  is  that,  with  the  exception  of  (2.31),  the 
Kuhn-Tucker  conditions  are  a  set  of  linear  equations  (equalities  and  inequalities)  and 
any  solution  to  this  set  which  also  satisfies  the  complementary  slackness  condition 
will  be  a  solution  to  (2.29). 

If  3i  €  {1,...,A’}  s.t.  T{f,z)ii  <  V;  ^  t,  then  letting  p,  =  1,  pj  = 

0  Vj  ^  t,  Vi  =  0,  Vj  =  T{f,z)ji  -  T(f,s)ii  Vj  ^  i ,  and  u  =  r(/,z)„  will  satisfy  the 
Kuhn-Tucker  conditions.  If  such  a  solution  is  not  apparent,  the  complementary  pivot 
algorithm  can  be  used  to  find  a  solution. 

(2.30)  through  (2.32)  are  not  in  a  form  which  allows  direct  solution  using  the 
complementary  pivot  algorithm.  The  algorithm  presented  in  [24]  and  [25]  does  not 
allow  the  inclusion  of  the  unrestricted  variable  u  and  the  vector  £  which  are  associated 
with  the  linear  equality  constraint  that  £*£^  =  1.  However,  with  some  algebraic 
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manipulation,  the  Kuhn-Tucker  conditions  can  be  rewritten  as 


£  >  0,  and  3£  >  0  s.t. 

(2.33) 

£*£  =  0  and 

(2.34) 

V  -  M{f,z)^  =  r(/,£) , 

(2.35) 

where =  [ui,  Po,,  Poj, •  •  •  »Pojf  ]  and  =s  [p^,,  yj,  03, ...  ,v/f  ].  Using  thefollowing 
notation  for  T(f,z) 


TiLi)  = 


Tix{f,z)  r(/,z) 
i(/,i)  t{U) 


1 


r(f,  z)  is  given  by 

1 


and  M{f,z)  is  the  following  matrix 


0  -e* 

.£  M(/,2) 


and  M{f,  1)  =  f{f,  1)  +  Tii{f,  z)  c  e‘  —  (/,  z)  —  f(/,  z)  £*.  Any  £  and  v  which  satisfy 

(2.33)  through  (2.35)  will  yield  a  ji  and  £^  for  which  (2.30)  through  (2.32)  will  be 
satisfied  for  some  u;  and  any  £  and  for  which  a  u  exists,  satisfying  (2.30)  through 
(2.32),  will  yield  a  £  and  £  which  satisfy  (2.33)  through  (2.35) .  Therefore,  a  solution 
to  (2.30)  through  (2.32)  can  be  found  by  solving  (2.33)  through  (2.35);  and  (2.33) 
through  (2.35)  are  in  a  form  for  which  a  solution  can  be  found  directly  using  the 
complementary  pivot  algorithm. 

The  complementary  pivot  algorithm  used  to  find  a  solution  to  (2.33)  through  (2.35) 
is  referred  to  as  Scheme  I  in  [24].  The  algorithm  finds  a  solution  by  introducing  a 
slack  variable  Zo  and  a  vector  with  positive  entries  i  and  then  conceptually  solving 
the  problem 


minzo  s.t. 


(2.36) 
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(2.37) 

(2.38) 

(2.39) 


«o,  £,  £  >  0  , 

=  0 ,  and 

i  -  M{f,z)i-dzo  -  Ms). 

Since  a  solution  to  (2.33)  through  (2.35)  is  guaranteed  to  exist,  the  solution  to  (2.36) 
through  (2.39)  will  be  =  0  with  a  £  and  £  which  satisfy  (2.33)  through  (2.35). 

The  details  of  the  complementary  pivot  algorithm  can  be  examined  in  [24]  or 
[25].  In  concept,  the  complementary  pivot  algorithm  is  very  similar  to  the  simplex 
method  [25]  for  solving  linear  programming  problems.  The  algorithm  is  solving  for 
2K  + 1  real  nonnegative  variables  and  at  the  solution  to  (2.36)  through  (2.39),  z  =  0. 
(2.39)  is  a  set  of  K  linear  equations  and  is  satisfied  at  each  iteration  of  the  algorithm. 
By  construction,  at  each  iteration  of  the  algorithm,  at  least  A”  +  1  of  the  unknown 
variables  equal  zero.  Using  the  terminology  from  the  simplex  method,  the  term  “non- 
basic  variables”  will  refer  to  A  + 1  of  the  variables  which  equal  zero.  The  term  “basic 
variables”  will  refer  to  the  other  A  variables.  Collectively,  the  basic  variables  are 
referred  to  as  the  basis.  It  is  not  required  that  every  basic  variable  be  non-zero.  It  is 
only  required  that  every  non-basic  variable  equals  zero. 

At  each  iteration  of  the  algorithm,  a  new  basis  is  selected  in  a  manner  which 
guarantees  that  (2.37),  (2.38),  and  (2.37)  are  satisfied.  When  the  vauiable  z  leaves 
the  basis,  which  guarantees  that  z  =  0,  the  algorithm  terminates.  The  fundamental 
difference  between  the  simplex  method  and  the  complementary  pivot  algorithm  is  that 
in  the  complementary  pivot  algorithm,  only  one  variable  from  each  complementary 
pair  can  be  in  the  basis  at  any  given  time.  This  is  enforced  to  guarantee  that  (2.38) 
will  be  satisfied. 

The  critical  property  of  the  algorithm  for  the  purpose  this  work  is  its  convergence 
property.  The  essential  points  of  the  convergence  proof  for  the  complementary  pivot 
algorithm  [26]  are  that,  at  each  iteration  the  basis  changes,  no  basis  can  be  visited 
by  the  algorithm  more  than  once,  and  there  are  a  finite  number  of  possible  bases. 
These  three  points  lead  to  the  conclusion  that  the  algorithm  must  terminate  in  a 
finite  number  of  iterations. 


The  following  two  claims,  for  which  proofs  are  contained  in  Appendix  B,  are 
necessary  in  order  to  analyze  the  convergence  of  the  algorithm  in  a  more  rigorous 
manner. 

Claim  1  A/(/,r)  is  a  co-positive-plus  matrix. 

Claim  2  The  set  of  equations  (2.33)  through  (2.35)  are  consistent. 

Given  Claims  1  and  2,  Theorem  11.1.8  on  page  446  of  [25]  states  that,  if  each  almost 
complementary  basic  feasible  solution  to  (2.37)  through  (2.39)  is  nondegenerate,  then 
the  complementary  pivot  algorithm  will  terminate  in  a  finite  number  of  iterations 
with  a  solution  to  (2.33)  through  (2.35). 

The  outstanding  condition  on  which  this  finite  convergence  property  of  the  algo¬ 
rithm  depends  is  that  each  almost  complementary  basic  feasible  solution  to  (2.37) 
through  (2.39)  is  nondegenerate.  In  [26]  this  is  referred  to  as  the  system  being  non¬ 
degenerate.  Lemma  4  on  page  616  of  [26]  states  that  almost  every  vector  d.  will  yield 
a  nondegenerate  system.  Therefore,  in  practice,  system  degeneracy  is  usually  not  a 
problem.  However,  should  the  degeneracy  of  the  system  be  a  concern,  a  modification 
of  the  complementary  pivot  algorithm  can  be  used  (Section  7  of  [26]).  Given  Claims  1 
and  2,  Theorem  2  on  page  618  of  [26]  states  that  the  modified  algorithm  will  converge 
in  a  finite  number  of  iterations  with  a  solution  to  (2.33)  through  (2.35).  A  clear 
explanation  of  the  modified  algorithm  is  given  on  pages  80  and  81  of  [27].  The  form 
in  which  the  modified  algorithm  is  presented  in  [27]  is  slightly  different  from  the  form 
used  in  [26],  but  the  two  algorithms  are  identical. 

Conceptually,  the  modification  to  the  complementary  pivot  algorithm  is  that  the 
system  of  linear  equalities  (2.39)  is  modified  to 

i  -  M(/,i)£-izo  =  £(/,£)  + /£,  (2.40) 

where  7  is  the  identity  matrix,  £  s  [j,  . . . ,  £^]*,  and  £  is  an  arbitrarily  small  pos¬ 

itive  constant  which  remuns  unspecified  throughout  the  execution  of  the  algorithm. 
This  small  perturbation  to  (2.39)  can  be  shown  to  always  create  a  nondegenerate  sys¬ 
tem  (2.37),  (2.38),  and  (2.40).  In  practice,  the  modification  requires  simply  that  the 
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scalar  comparisons  of  the  updated  ratios  in  Step  1  of  the  ori^nal  complemen¬ 
tary  pivot  algorithm  (page  440  of  [25])  be  replaced  by  lexicographic  comparisons  of 
the  updated  vector  ratios  where  L{f,s)  li  denotes  the  row  of  the  updated 

righthand  side  matrix  [r(/,£)  /]. 

2.3.4  Least  Favorable  PMF  form  of  the  Array  Processing 
Algorithm 

Given  the  least  favorable  pmf,  the  optima!  minmax  array  weights  can  be  calculated  by 
applying  Theorem  4  and  (2,24).  The  power  estimate  can  be  calculated  using  (2.10). 
These  can  be  combined  to  yield  the  following  three-step  algorithm  for  implementing 
the  Adaptive  Minmax  Matched  Field  Processor  which  does  not  require  an  a  priori 
assumption  about  the  average  signal  power. 

1.  Use  the  (modified)  complementary  pivot  algorithm  to  calculate 

2^J  -  argmm  £* 


2.  aUi)  =  (4r(/,z)£,p-» 

3.  w^t{f,z,aV,S.))  =  S(/)-'  Qif.l)  2,J 

2.4  Minmax  Estimation  Error  Bounds 

In  Section  2.1,  a  general  framework  for  minmax  signal  processing  was  described  and 
a  theorem  characterizing  optimal  minmax  estimators  was  developed.  However,  this 
characterization  required  that  the  estimator  be  parameterized  by  a  vector  js.  ^<I  that 
the  problem  be  reduced  to  finding  the  optimal  parameter  vector  ]£.  This  parameteri¬ 
zation  requires  that  the  “optimal”  estimator  lie  within  a  particular  class  of  estimators 
(e.g.,  linear  estimators).  While  the  resulting  estimator  is  the  best  from  within  the  cho¬ 
sen  class  of  estimators,  it  is  usually  not  possible  to  make  a  definitive  statement  about 
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how  the  estimator’s  performance  compares  to  that  of  arbitrary  unspecified  classes  of 
estimators.  Therefore,  it  is  useful  to  develop  lower  bounds  on  the  performance  (as 
measured  by  the  error  functions  e  and  A)  of  any  estimator.  While  not  explicitely 
calculating  any  bounds,  this  section  proposes  an  approach  to  the  development  of  such 
performance  bounds  for  the  case  where  e  is  the  conditional  mean-squared  estimation 
error.  In  the  development  of  this  approach,  the  form  of  the  optimal  minmax  estimator 
and  an  achievable  lower  bound  are  derived. 

The  problem  for  which  the  bound  is  proposed  is  the  estimation  of  a  scalar  param¬ 
eter  0  6  R  based  upon  observations  x  £  X  where  X  is  the  observation  space.  Given 
any  estimator  ^  :  A*  — »  R,  the  mean-squared  estimation  error  of  6  is  assumed  to 
depend  on  some  environmental  parameters  ^  In  the  literature,  these  parameters 
are  also  referred  to  as  nuisance  parameters.  The  bound  developed  will  be  a  lower 
bound  on  the  extremal  value 

A(5)=  max  eig,<i>) 

for  the  case  where 

«(«.«>)  =  E|(«-,(i))’UJ,  (2.41) 

the  conditional  expectation  is  taken  over  all  0  and  x;  and  g  is  any  function  mapping 
X  into  R. 

The  approach  to  bounding  A(y)  is  developed  using  the  least  favorable  pmf  frame¬ 
work  where  the  environmental  parameter  ^  is  considered  to  be  a  random  parameter. 
This  is  the  same  framework  used  in  Section  2.3  to  develop  an  efficient  algorithm  for 
solving  the  minmax  problem.  As  in  this  earlier  development,  the  set  of  possible  values 
for  the  environmental  paramters  $  is  sampled  to  yield  a  finite  discrete  set  and  the 
vector  £  is  the  pmf  for  ^  on  this  set.  In  the  earlier  development,  the  sampling  was 
needed  for  computational  reasons  as  well  as  to  ensure  that  the  error  measure  and  its 
gradient  were  both  continuous  on  However,  here  the  sampling  is  needed  only  for 
computational  reasons. 

The  following  theorem,  which  is  needed  to  prove  Theorem  6,  provides  some  insight 
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into  the  proposed  approach  to  bounding  A(^).  A  proof  of  this  theorem  is  contained 
in  Appendix  A. 


Theorem  5  Let  e{g,  <t>)  be  any  performance  measure  for  the  estimator 
g  given  the  environmental  condition  Let  $  =  let 

£  €  P  =  {£  €  I  £  >  0  “id  £*  £  —  1}  “ly  assigned  to  <f>  on 
Let  e(£,^)  =  E^i  ^i)  “id  let  ^(£)  be  any  global  lower  bound  on 
e(£,^).  That  is, 

/9(£)  <  min  £(£,^). 

Then,  ^(£)  is  also  a  lower  bound  on  A{g).  That  is. 


Ao=  min  A(s)  >  ^(£). 

«:X-.R 

The  implication  of  Theorem  5  is  that,  if  e{g,  <f>)  is  the  conditional  mean-squared 
estimation  error  as  given  in  (2.41),  then  e(2yg)  is  the  mean-squared  estimation  error 
and  the  traditional  methods  of  bounding  e(£,5)  (e.g.,  the  Cramer- Rao  bound,  the 
Weiss- Weinstein  bound,  etc.)  can  be  applied  to  bounding  A(^).  Since  the  inequality 
^(£)  $  Ao  holds  for  any  pmf,  £  can  be  chosen  to  maocimize  the  bound  being  used 
0{£).  However,  this  may  not  be  useful  unless  the  derived  bounds  are  reasonably  tight. 

The  following  theorem,  a  proof  of  which  is  contained  in  Appendix  A,  specifies  the 
optimal  minmax  estimator  and  an  achievable  bound  on  A(^),  and  sheds  some  light 
on  the  tightness  which  can  be  expected  of  the  bounds  developed  using  the  approach 
in  the  preceding  paragraph. 

Theorem  6  Let  $  =  P  =  {£€  R^  I  £  ^  0  “id  £*£=  1}> 

and  :  A"  X  P  -►  R  be  given  by 


^o(x,£) 


(2.42) 


where  p*|^(x  |  <f>i)  is  the  conditional  pdf  or  pmf  of  the  observation  x  given 
that  the  environmental  parameter  ^  Let  £{g^<l>)  —  E[(^— i^(z))*  ]  4>]- 
Let  the  least  favorable  pmf  £^^  €  P  be  defined  as 


A 


Then 


yo(*,£,P  = 


and  the  quantity 

K 

^PiJi  £i9o(x,2ij)^<i>i) 
i=l 

is  an  achievable  lower  bound  on 

A{g)  = 

If  <f>  is  considered  a  random  parameter  with  the  pmf  £,  then  (2.42)  can  be  rewritten 
as 

9o(x,2)  =  I 

and  go{x,^  is  the  minimum  mean-squared  error  estimator  of  6  given  x.  £(£,p)  is  the 
mean-squared  estimation  error  achieved  by  g(x)  given  the  pmf  £;  and  £(p,go(x,p))  is 
the  minimum  achievable  mean-squared  estimation  error  given  the  pmf  £.  Therefore, 
is  the  pmf  for  which  the  minimum  achievable  meain-squared  estimation  error  is 
maximized,  hence  the  term  least  favorable  pmf. 

In  most  situations,  it  will  be  impractical  to  compute  the  least  favorable  pmf  £^^, 
the  optimal  minmax  estimator  go(x,2jf)i  or  the  associated  achievable  minmax  es¬ 
timation  error  bound  Ao  =  H^iPifi  E[(^  —  9oix,2j^)y  \  ^.]  .  However,  Ao  is  an 
achievable  minmax  bound  and  equals  the  achievable  mean-squared  estimation  error 
bound  given  the  least  favorable  pmf.  Therefore,  it  is  reasonable  to  expect  that  the 
bound  development  approach  outlined  earlier  (i.e.,  choosing  the  pmf  to  maximize 
a  mean-squ2U'ed  estimation  error  bound  such  as  the  global  Cramer-Rao  bound)  will 
yield  minmax  bounds  which  are  at  least  as  tight  as  the  mean-squared  estimation  error 
bound.  The  further  investigation  of  this  approach  is  left  for  future  work. 
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Chapter  3 


Analysis  and  Interpretation  of  the 
Adaptive  Minmax  Matched  Field 
Processor 


The  analysis  of  the  algorithm  developed  in  Chapter  2  is  complicated  by  the  lack  of 
illustrative  analytical  solutions  to  the  minmax  problem.  The  analysis  of  the  charac¬ 
teristics  of  the  Adaptive  Minmax  Matched  Field  Processor  presented  in  Section  3.1  is 
based  upon  the  interpretation  of  the  processor  as  a  Two-Stage  MVDR  Matched  Field 
Processor  and  is  qualitative  rather  than  quantitative  in  nature.  This  analysis  moti¬ 
vates  a  modification  to  the  definition  of  the  signal  replica  vector  which  is  presented 
in  Subsection  3.1.3.  A  quantitative  analysis  of  the  algorithm  based  upon  numeri¬ 
cal  simulations  is  presented  in  Section  3.2.  The  cases  which  are  considered  involve 
propagation  in  a  deterministic  ideal  waveguide,  a  deterministic  horizontally-stratified 
ocean,  and  a  random  ideal  waveguide.  The  results  for  random  ideal  waveguide  are 
presented  in  Subsection  3.2.3.  They  motivate  a  further  modification  of  the  definition 
of  the  signal  replica  vector  and  show  clearly  the  shortcomings  of  the  MVDR,  Bartlett, 
and  Adaptive  Minmax  Processors  when  signal  coherence  over  the  array  aperture  is 
reduced.  Finally,  Section  3.3  contains  a  brief  analysis  of  the  computational  complex¬ 
ity  of  the  modified  complementary  pivot  algorithm  used  to  solve  the  minmax  problem 
in  Step  1  of  the  Adaptive  Minmax  Matched  Field  Processor. 


Figure  3-1:  The  Two-Stage  MVDR  Matched  Field  Processor 


3.1  MVDR  Interpretation  of  the  Adaptive  Min- 
max  Matched  Field  Processor 

The  Adaptive  Minmax  Matched  Field  Processor  developed  in  Chapter  2  C2ui  be  in¬ 
terpreted  as  the  combination  of  an  algorithm  which  calculates  an  effective  replica 
vector,  which  will  be  denoted  by  and  a  MVDR  Matched  Field  Processor  [9,  11] 
which  uses  as  the  replica  vector  of  the  desired  signal  (Figure  3-1).  For  several 
reasons,  this  interpretation  is  useful.  First,  it  relates  the  minmax  array  processor  to 
an  array  processor  whose  properties  are  well-understood.  Second,  it  mtdces  possible 
a  qualitative  analysis  of  the  properties  of  the  minmax  array  processor.  Finally,  the 
interpretation  motivates  a  modification  of  the  replica  vector  normalization  convention 
used  in  the  minmax  array  processor,  which  improves  the  performance  of  the  proces¬ 
sor.  Subsection  3.1.1  details  the  new  interpretatic;.  of  the  minmax  array  processor. 
Subsection  3.1.2  uses  this  interpretation  to  analyze  some  properties  of  the  minmax 
array  processor  while  Subsection  3.1.3  motivates  a  modification  to  the  minmax  array 
processor  as  developed  in  Chapter  2  and  details  the  modified  algorithm.  Finally,  Sub¬ 
section  3.1.4  uses  this  interpretation  to  formulate  guidelines  for  the  required  samipling 
of  $  in  creating  the  discrete  set  of  replica  vectors  over  which  the  minmax  optimiza¬ 
tion  is  carried,  and  discusses  some  factors  which  effect  the  selection  of  the  uncertainty 
range  of  environmental  parameters  over  which  the  processor  must  operate. 

The  structure  of  the  MVDR  Processor  [9]  is  identical  to  the  structure  the  Adaptive 
Minmax  Processor  as  shown  in  Figure  2-4.  The  array  weights  of  the  matched  field 
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implementation  of  the  MVDR  Processor  [11]  in  a  deterministic  medium  are  given  by 


w^t  =  arg  Tmnw^S{f)w 
hlgC” 

such  that  £*(/,z,^)tn  =  1, 


(3.1) 


where  the  replica  vector  is  defined  in  the  traditional  manner  as  described  in  Section  1.2 
and  ^  is  the  p2irameterization  of  the  assumed  environmental  conditions.  The  solution 
to  (3.1)  is  given  by 


kf)~'  £(/,!>  i,) 


(3.2) 


and  the  resulting  estimate  of  the  average  power  in  the  signal  emitted  by  the  source 
at  the  array  focal  point  is 


»'(/,!)  =  (s‘(/,i,l)  S(/)-'  £(/,!,£))-■.  (3.3) 


The  relationship  between  the  Adaptive  Minmtix  Matched  Field  Processor  and  the 
MVDR  Matched  Field  Processor  which  is  developed  in  this  section  is  built  upon  the 
similarity  between  the  form  of  the  solutions  in  (3.2)  and  (3.3)  and  the  solutions  for 
the  weights  and  estimated  average  power  in  Steps  2  and  3  of  the  algorithm  detailed 
in  Subsection  3.1.1  which  implements  the  Adaptive  Minmax  Processor.  In  compeiring 
these  solutions,  it  is  understood  that  two  processors  use  different  definitions  of  the 
signal  replica  vector,  and  that  under  some  signal  and  environmental  conditions  and 
vector  normalization  conventions,  the  two  definitions  are  equivalent. 


3.1.1  The  Two-Stage  MVDR  Matched  Field  Processor 

The  interpretation  of  the  Adaptive  Minmax  Matched  Field  Processor  as  the  Two- 
Stage  MVDR  Matched  Field  Processor  shown  in  Figure  3-1  is  motivated  by  noting 
that  the  algorithm  detailed  in  Subsection  2.3.4  can  be  rewritten  as 
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1.  Use  the  (modified)  complementary  pivot  algorithm  to  calculate 


2.  =  (4  (?'*(/, l)S(/)-^g(/,l)  2is)~" 

3.  =  ar'^U.z)  S(/)-^  Q(/,l)  £,y 

The  set  {(?(/,  a)£  |  £  >  fiand  e‘£  =  1}  is  the  convex  hull  of  the  set  of  column 
vectors  in  Q(f,  g)  (i.e.  {£(/,!, ^j), . . .  Defining  Q(f,z}  to  be  this  set  of 

replica  vectors  (column  vectors  of  Q(f,z)),  the  following  algorithm  is  equivalent  to 
the  Adaptive  Minmax  Matched  Field  Processor. 

1.  Use  the  (modified)  complementary  pivot  algorithm  to  calculate 

X  S(/)’^9 . 
i€W(g(/,i))- 


2.  ^^(/,l)  =  S(/)-^£^^p-^ 


3.  u!op«(/il><^*(/,l))  = 


Steps  2  and  3  of  this  algorithm  are  the  MVDR  Matched  Field  Processor  given  the 
replica  vector 


3.1.2  Analysis  of  the  Two  Stage  MVDR  Matched  Field  Pro¬ 
cessor 

From  Steps  1  and  2  of  the  Two-Stage  MVDR  Matched  Field  Processor,  £^^^  is  the 
vector  contained  in  H  which  maximizes  the  power  passed  through  the  re¬ 

sulting  MVDR  Matched  Field  Processor.  A  principal  characteristic  of  the  processor 
is  brought  to  light  by  this  interpretation. 

The  processor  treats  any  energy  in  any  of  the  rank  one  subspaces  spanned  by  the 
replica  vectors  in  H  (Q(/,i))  as  energy  of  the  desired  signal  and  attempts  to  pass 
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as  much  of  that  energy  as  possible  through  the  MVDR  Matched  Field  Processor. 
The  processor  does  this  by  first  hypothesizing  a  rank  one  signal  subspace.  It  chooses 
the  basis  for  this  subspace  from  within  7i  ((?(/,  l))  to  maximize  the  energy  in 
this  subspace  as  measured  by  the  MVDR  Matched  Field  Processor.  This  hypothesis 
of  a  rank  one  signal  subspace  spanned  by  some  replica  vector  in  H  ((?(/,  i))  causes 
performance  problems  in  a  random  propagation  medium  as  demonstrated  by  the 
results  in  Subsection3.2.3). 

This  characteristic  of  the  processor  is  seen  explicitly  when  Step  1  of  the  algorithm 
is  written  as 


N 


^}J  =  V  S  M  IJ.- £  ?, 

9€W  {Q(f,z))  *=1 


where  (A,,  u,)  are  the  i*'*  eigenvalue  and  eigenvector,  respectively,  of  S(/).  Assume 
that  all  vectors  in  H  (Q(/,i))  have  approximately  the  same  norm  (the  effect  of  vec¬ 
tors  with  widely  different  norms  is  analyzed  in  Subsection  3.1.3).  Since  S(/)"'  is 
Hermitian,  its  eigenvectors  will  comprise  an  orthonormal  set  which  spans  C^.  There- 
fore,  Xrjlj  I  P  =  1  £  p.  Since  all  £  €  H  (Q(/,i))  have  approximately  the  same 
norm,  I  Ji?£  P  can  be  minimized  only  by  adjusting  the  relative  magnitudes 

of  the  projections  of  q  on  each  of  the  eigenvectors.  Within  the  constraints  imposed 
by  X  the  solution  will  have  a  minimal  projection  on  the  eigenvectors 

whose  corresponding  eigenvalues  are  small  (i.e..  A,"*  is  large)  and  a  larger  projection 
on  the  eigenvectors  whose  corresponding  eigenvadues  are  large  (i.e.,  Af^  is  small). 
Conceptually,  this  corresponds  to  the  processor  maximizing  the  projection  of  g^^^  on 
the  portion  of  the  subspace  of  the  received  signal  (desired  signal  plus  noise)  with  the 
largest  average  power. 

The  characteristics  of  the  processor  and  the  tradeoffs  which  it  makes  in  calculating 
g^jj  can  be  seen  more  clearly  by  considering  the  following  special  case.  Assume 
that  the  desired  signal  is  a  deterministic  narrowband  signal  and  that  the  medium  is 
deterministic.  Then  the  subspace  of  the  desired  signal  will  have  rank  one  and  the 
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cross-spectral  correlation  matrix  of  the  received  signal  will  be 


S(f)  =  <rlSM)  + 


Here,  Sn{f)  is  the  norm2Jized  cross-spectral  correlation  matrix  of  the  noise,  is 
the  actual  replica  vector  of  the  desired  signal,  and  and  aj  are  the  average  power 
of  the  noise  and  the  desired  signal,  respectively.  For  this  example,  it  is  assumed  that 
S(/)  =  S(/). 

Several  concepts  and  quantities  are  useful  in  analyzing  the  processor  for  this  special 
case  [13].  The  first  of  these  is  the  array  gain  denoted  by  G.  Considering  the  array 
processor  to  be  a  spatial  filter,  the  array  gain  is  a  function  of  the  filter  weights.  It  is 
defined  as  the  ratio  of  the  signal  to  noise  ratio  at  the  output  of  the  filter  to  the  signal 
to  noise  ratio  at  the  input  of  the  filter  where  the  signal  to  noise  ratio  is  defined  as 
S/N  =  For  the  special  case  under  consideration,  the  array  gain  is  given  by 


=  I  I" 

!£*Sn(/)ie’ 

In  Section  3.2,  the  rate  of  decay  of  the  array  gdn  as  a  function  of  changes  in 
2^^  will  be  used  as  a  measure  of  the  robustness  of  a  processor.  It  can  be  shown 
that  the  linear  filter  weights  which  m2Lximize  the  array  gain  are  given  by  — 

^n{f)~^2act’  Substituting  these  weights  into  (3.4)  yields  Gmax  —  2^^  S„(/)  ^  2^^ 
and  the  maximum  achievable  signal  to  noise  ratio  at  the  output  of  a  linear  filter  as 
{S/NU.  =  ^  S„(/)-^  2^^. 

A  useful  measure  of  the  similarity  of  two  vectors  is  based  on  the  generalized  angle 
between  the  vectors.  For  any  positive-definite  Hermitian  matrix  C,  the  inner  product 
between  the  vectors  s.,b^C^  can  be  defined  as  a^Cb.  Given  this  inner  product,  the 
cosine-squared  of  the  generalized  angle  between  a  and  6  is  defined  as 


cos^(a,  6;  C)  = - - . 


The  cosine-squared  will  always  be  greater  than  or  equal  to  zero  and  less  than  or  equal 
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to  one.  It  will  equal  one  if  6  is  a  scalar  multiple  of  a  and  will  equal  zero  if  a  and  b 
are  orthogonal  with  respect  to  the  inner  product  defined  earlier.  Treating  the  inner 
product  and  the  cosine-squared  as  measures  of  the  similarity  of  a  and  b,  the  inclusion 
of  the  matrix  C  places  greatest  weight  on  the  similarity  or  dissimilarity  of  a  and  6 
in  the  subspace  spanned  by  the  eigenvectors  of  C  with  the  largest  eigenvalues,  and 
places  little  emphasis  on  the  similarity  or  dissimilarity  in  the  subspace  spanned  by 
the  eivenvectors  with  small  eigenvalues.  These  measures  and  concepts  provide  the 
tools  needed  to  analyze  some  aspects  of  the  performance  of  the  processor. 

Using  the  identity 

(A  +  6  =  A-'  -  (1  +  b’^A-H)-\ 


the  adaptive  replica  vector  calculation  problem  can  be  expressed  as 


i€W  ((?(/,!)) 


((S/N)„,.  + 1 
V  (S/N)„„ 


(3.5) 

From  this  expression,  it  can  be  seen  that,  when  the  maximum  achievable  signal  to 
noise  ratio  is  low,  the  processor  will  use  most  of  the  available  degrees  of  freedom  to 
minimize  (i.e.,  select  to  lie  in  the  noisy  subspace)  and  will  place 

little  emphasis  on  maximizing  cos^(£^^,g^^^;S„(/)”^)  (i.e.,  select  to  match 
as  closely  as  possible  in  the  relatively  noise-free  subspace.  A  low  maximum  achievable 
signal  to  noise  ratio  could  be  the  result  of  either  a  low  input  signal  to  noise  ratio  or 
a  low  Gmax  (i-e-,  the  actual  replica  rector  lying  in  a  noise  subspace  with  a  large 
average  power).  If  the  latter  is  the  cause,  then  the  expected  difference  between 
and  which  results  from  g^j^  lying  in  the  noisy  subspace  will  be  small  because 
g^^  lies  in  the  noisy  subspace  and  the  processor  may  be  able  to  choose  g^^j  to  lie 
in  the  noisy  subspace  and  be  close  to  However,  if  the  former  is  the  cause  and 
does  not  lie  in  the  noisy  subspace,  the  expected  difference  between  g^jj  and  g^^^ 
will  be  larger.  This  increased  mismatch  is  the  price  which  is  paid  for  a  significant 
amount  of  noise  power  existing  in  a  rank  one  subspace  spanned  by  a  replica  vector 
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in  W  (Q(/,z)).  However,  the  results  developed  by  Cox  [13]  indicate  that  for  cases 
such  as  this,  where  the  input  signal  to  noise  ratio  is  low,  the  array  gain  will  not 
decrease  signiiic2Lntly  as  the  mismatch  between  the  actual  and  effective  replica  vectors 
increases.  The  large  amount  of  noise  power  which  will  be  passed  through  the  filter 
in  this  case  is  consistent  with  the  processor  selecting  to  pass  as  much  power  as 
possible  through  the  resulting  MVDR  linear  filter. 

If  the  maximum  achievable  signal  to  noise  ratio  is  high,  the  processor  will  place  an 
increased  emphasis  using  the  available  degrees  of  freedom  to  maximize  cos*(2^^,  2^^^;  Sn(/)-^). 
Thus,  as  the  maximum  achievable  signal  to  noise  ratio  increases,  the  processor  will 
reduce  the  mismatch  between  and  The  tendency  of  the  processor  to  place  an 
increased  emphasis  on  increasing  cos^(2^^,2^^^;  Sn(/)“^)  as  the  maximum  achievable 
signal  to  noise  ratio  is  increased  is  verified  numerically  in  Section  3.2. 

The  effect  of  the  adaptive  replica  vector  calculator  on  the  array  gain  of  the  result¬ 
ing  processor  can  be  seen  by  rewriting  (3.4)  as  [13] 

g.  V  _  _ Gmax  _ 

'S.//'  -  1  +  +  ^G„„)(l  -  cos2(s.„,s.„;S„(/)-'))' 

Here,  the  array  gain  is  expressed  as  a  function  of  the  because  the  array  weights 
are  a  function  of  g^^^.  Consider  the  effect  on  the  array  gain  if  is  increased  , 
does  not  change,  and  cos*(g^^^,g^^^;S„(/)“')  /  1.0.  In  this  case,  ^'(2*//) 
decrease.  However,  from  the  preceding  analysis,  the  processor  adjusts  to  an  increase 
in  (and  the  corresponding  increase  in  (5/iV),no*)  by  adjusting  9  to  increase 
cos^(go^,g^^^;  Sn(/)~^).  This  adjustment  reduces  the  mismatch  between  g^^^  and 
and  therefore  reduces  the  loss  in  array  g«un. 

The  preceding  example  provides  a  qualitative  analysis  of  the  Adaptive  Minmax 
Matched  Field  Processor  using  the  interpretation  of  the  processor  as  a  Two-Stage 
MVDR  Matched  Field  Processor.  The  assumption  throughout  this  analysis  has  been 
that  all  of  the  replica  vectors  contdned  in  H  ((?(/,  i))  have  approximately  the  same 
norm.  Should  this  assumption  be  violated,  the  characteristics  of  the  processor  may 
differ  greatly  from  what  would  be  expected  based  upon  this  analysis.  The  effect  of 
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replica  vectors  with  widely  different  norms  on  the  performance  of  the  processor  and  a 
modification  to  the  algorithm  which  mitigates  this  effect  are  covered  in  the  following 
subsection. 


3.1.3  Replica  Norm  Considerations  and  a  Normalization 
Modification 


In  defining  the  signal  replica  vector  in  Section  1.2,  the  normalization  convention  was 
that  the  term  equals  one  where  k  is  the  number  of  the  reference  sensor.  This  con¬ 
vention  was  adopted  to  facilitate  the  factorization  of  the  cross-correlation  function 
E[2C(/»l)-^*(/,l)  I  into  where  a^(/,  r)  is  the  average  power  in  the 

desired  signal  as  received  at  the  reference  sensor.  When  using  large  aperture  arrays 
in  the  ocean  environment,  it  is  possible  for  different  environmental  conditions  to  yield 
different  ratios  of  the  average  power  in  the  desired  signal  at  the  reference  sensor  to 
the  norm  of  the  cross-correlation  vector  I  For  example,  if  the 

receiving  array  is  a  large  vertical  array  and  a  deterministic  Normal  Mode  propaga¬ 
tion  model  is  used  (see  Section  4.3  for  a  discussion  of  the  normal  mode  propagation 
model.),  then  different  environmental  conditions  will  result  in  different  mode  shapes 
and  there  will  be  different  distributions  of  signal  energy  as  a  function  of  depth.  This 
will  result  in  the  ratio  of  the  average  power  at  the  reference  sensor  to  the  norm  of  the 
cross-correlation  vector  varying  with  different  environmental  conditions.  Therefore, 
the  normalized  cross-correlation  vectors  (replica  vectors)  for  different  environmental 
conditions  may  have  different  norms. 

To  understand  the  effect  that  replica  vectors  with  different  norms  may  have  on 
the  adaptive  replica  vector  calculation  problem,  note  that  (3.5)  can  be  expressed  as 


2e//  = 


mm 


la 


a'^  Sn(/)-»  £ 


({S/NUa. 
\  {S/N)r 


+  1 


-  COS*(aact’a;Sn(/) 


(3.6) 

The  factor  in  the  brackets  [  ]  is  independent  of  the  norm  of  £,  depending  upon 

only  the  direction  of  £.  The  processor  will  jointly  adjust  the  norm  and  the  direction 
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Fig  3-2:  The  Convex  Hull  of  Replicas  with  Different  Norms 

of  £  within  the  constraints  imposed  by  Ti  (Q{f,z)^  to  balance  the  minimization  of 
the  norm-squared  of  £  and  the  factor  in  the  brackets,  respectively.  Consider  the 
case  shown  in  Figure  3-2,  where  £  €  and  the  norm  of  one  replica  vector  in  Q 
is  significantly  less  than  the  norm  of  the  other  replica  vectors  in  Q.  In  this  case, 
the  adaptive  replica  vector  calculator  will  tend  to  select  £^^^  to  lie  in  the  vicinity  of 
£(^2),  regardless  of  the  direction  of  £^^.  This  will  greatly  limit  the  adjustment  of 
the  direction  of  £^^^  reducing  the  ability  of  the  processor  to  adapt  to  different  actual 
replica  vectors,  and  will  cause  mismatch  between  £^^^  and 

The  array  gain  (3.4)  is  a  function  of  the  direction  of  the  array  weight  vector  rather 
than  its  norm.  The  direction  of  the  array  weight  vector  (Step  3  of  the  algorithm  in 
Subsection  3.1.1)  is  a  function  of  the  direction  of  the  £^^^  but  not  its  norm.  Therefore, 
the  array  gain  is  a  function  of  the  direction  of  the  £^^^  rather  than  its  norm.  Therefore, 
it  is  desirable  for  the  processor  to  adjust  the  direction  of  £^^^  without  regard  to 
the  resulting  norm  of  The  ability  to  adjust  is  improved  significantly  by 
modifying  the  definition  of  the  replica  vector  so  that  the  replica  vectors  have  a  norm 
of  one.  That  is,  define  the  replica  vector  as 


£(/,i,^)^e(2C(/,i)x;(/,i)  i^/|e[2C(/,^)x;(/,i)  i^|. 
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As  a  result,  no  longer  has  the  strict  interpretation  as  the  average  power  in  the 

desired  signal  as  received  at  the  reference  sensor.  However,  in  numerical  simulations, 
this  modification  significantly  increased  the  ability  of  the  adaptive  replica  vector 
calculator  to  adjust  to  track  as  the  latter  changed.  There  was  a  smaller 
mismatch  between  and  and  an  improved  performance  of  the  processor. 

A  second,  related  modification,  which  also  improved  the  performance  of  the  pro¬ 
cessor  is  motivated  by  considering  Figure  3-3,  which  shows  the  convex  hull  of  four 
replica  vectors,  all  of  which  have  been  normalized  to  have  a  norm  of  one.  Clearly, 
even  though  the  vectors  which  define  the  convex  hull  (i.e.,  the  vectors  in  (?(/,i)) 
have  a  norm  of  one,  the  convex  hull  contains  vectors  which  have  norms  of  less  than 
one.  However,  from  Subsection  3.1.2,  the  function  of  q^^j  is  to  define  a  rank  one 
signal  subspace  in  which  the  processor  estimates  the  average  power  using  the  MVDR 
Matched  Field  Processor.  Since  the  subspace  spannedby  is  independent  of  the 
norm  of  the  resulting  estimated  average  power  should  be  independent  of  the 

norm  of  This  desired  property  is  accomplished  by  normalizing  q^^^  to  have  a 
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norm  of  one.  This  normalization  is  implemented  by  modifying  Steps  2  and  3  of  the 
algorithm.  The  resulting  algorithm  is 


1.  Use  the  (modified)  complementary  pivot  algorithm  to  calculate 

=  arg  mm  £*  Q^(/,l)S(/)"'(2(/,z)  £ , 

where  all  the  columns  of  Q{f,z)  are  normalized  to  have  a  norm  of  one. 

2.  = 

3. 

As  with  the  two  modifications  detailed  previously  in  this  subsection,  <5^(/,l)S(/)“^(5(/,  i) 
can  be  replaced  by  Real((5*(/,l)S(/)“*(^(/,2))  in  Steps  1  and  2  of  the  algorithm. 

For  the  same  reason,  Q^(f,z)  Q(f,z)  can  be  replaced  by  Real((5*(/,l)  <5(/»l))  in 
Steps  2  and  3. 

The  preceding  two  modifications  do  not  entirely  eliminate  the  no.m  of  as  a 
factor  in  selecting  the  direction  of  the  effective  replica  vector  (i.e.  Referring  to 

Figure  3-3,  it  is  clear  that  even  though  the  replica  vectors  in  Q{f,  z)  have  unit  norm, 
vectors  with  norms  less  than  one  will  be  contained  in  H  ((?(/,  i)).  Therefore,  while 
the  effect  of  vector  norms  is  greatly  reduced  by  normalizing  the  replica  vectors,  the 
selection  of  according  to  (3.6)  will  be  biased  towards  the  minimum  norm  area  of 
the  convex  hull.  This  is  verified  numerically  in  Subsection  3.2.1. 

3.1.4  The  Range  and  Sampling  of  the  Environmental  Pa¬ 
rameter  Set  $ 

In  Section  2.3,  a  method  of  solving  the  minmax  problem  in  Step  1  of  the  algorithm 
was  developed  which  required  the  range  of  environmental  parameters  over  which  the 
processor  is  designed  to  operate  to  be  sampled  to  yield  a  finite  set  $  = 

The  interpretation  of  Step  1  of  the  processor  as  an  adaptive  signal  replica  calculator 


Ui))  = 


(si,  QH!,^Q(l,^St,)^ 


which  selects  €  W  where  Q(f,z)  =  1 1  =  1, . . .  provides 

some  insight  into  the  desired  location  of  the  samples  of  the  operating  range  which  are 
included  in  the  finite  set  Using  the  term  extreme  point  to  refer  to  any  point  in 
a  set  which  is  not  representable  as  the  convex  combination  of  two  other  points  in  the 
set,  a  convex  hull  is  completely  defined  by  its  extreme  points.  Therefore,  nothing  is 
gained  by  including  a  point  in  $  for  which  the  corresponding  replica  vector  is  not  an 
extreme  point  of  H  Referring  to  Figure  3-2,  £(^i),  £(^2)*  and  £(^3)  axe 

the  extreme  points  of  the  convex  hull. 

Referring  to  Figure  3-3,  suppose  that  the  operating  range  of  the  processor  includes 
the  continuous  set  of  points  whose  replica  vectors  fill  the  continuous  arc  on  the  circle 
between  £(^i)  and  £(^4).  Then,  the  convex  hull  of  these  replica  vectors  includes 
the  region  bounded  by  the  arc  between  £(<^i)  and  £(^4)  and  the  chord  subtended 
by  that  arc  (i.e.,  the  straight  line  connecting  £(^i)  and  £(^4)).  This  convex  hull 
can  be  approximated  closely  by  including  several  environmental  points  in  $  whose 
corresponding  replica  vectors  are  spaced  evenly  along  the  arc  between  £(^i)  and  £(<^4) 

and  ^3  in  the  example  shown).  However,  with  further  analysis,  it  can  be  seen 
that  this  step  is  unnecessary. 

Any  point  in  the  convex  hull  shown  in  Figure  3-3  which  does  not  fall  on  the  chord 
connecting  £(^i)  and  £(^4)  can  be  expressed  as  a  point  on  that  chord  multiplied 
by  a  real  number  greater  than  one.  Therefore,  referring  to  (3.6),  the  term  in  the 
brackets  will  be  the  same  for  both  the  point  not  on  the  chord  and  the  corresponding 
point  on  the  chord.  However,  the  norm-squared  term  will  be  smaller  for  the  point  on 
the  chord.  Therefore,  £^^^  will  always  lie  on  the  chord  connecting  £(^i)  and  £(^4). 
The  solution  from  within  the  convex  hull  of  the  continuous  set  of  replica  vectors  will 
therefore  always  fall  within  fi  ({£(^1),  £(^4)})-  Thus,  no  performance  will  be  lost  if 

This  analysis  can  be  extended  to  higher  dimension  replica  vectors  to  determine  the 
number  and  the  location  of  the  samples  of  the  continuous  environmental  parameter  set 
required  to  insure  that  the  sampling  process  does  not  result  in  a  loss  in  performance. 
The  requirement  which  must  be  met  is  that  the  replica  vectors  at  the  selected  samples 
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should  define  a  convex  hull  which  is  a  subset  of  the  convex  hull  of  the  continuous  set  of 
replica  vectors  and  which  contains  a  set  of  points  which  is  guaranteed  to  contain 
(such  as  the  chord  in  the  two-dimensional  example  above).  One  problem  which  may 
occur  in  higher  dimensions  but  does  not  occur  in  two-dimensions  is  that  the  required 
convex  hull  may  have  an  infinite  number  of  extreme  points.  Therefore,  it  can  only 
be  approximated  by  the  convex  hull  of  a  finite  set  of  replica  vectors.  If  the  actual 
replica  vector  falls  outside  the  approximating  convex  hull,  then  mismatch  between 
and  will  occur.  The  resulting  MVDR  processor  will  tend  to  reject  (filter 
out)  the  desired  signal  and  will  suffer  the  same  performance  degr ablation  problems 
which  are  characteristic  of  the  MVDR  Matched  Field  Processor  in  the  presence  of 
environmental  mismatch  [13, 17].  The  significance  of  this  degradation  will  depend  on 
the  extent  to  which  g^^  falls  outside  of  “W  ((J(/,i)).  Numerical  results  illustrating 
the  dependence  of  processor  performance  on  the  sampling  density  of  the  set  $  and 
the  performance  degradation  which  results  when  g^^  falls  outside  of  "H  i))  are 
presented  in  Subsection  3.2.2. 

The  performance  degradaton  which  will  occur  if  g^^  falls  outside  of  7i  (<?(/,  l)) 
highlights  a  conflicting  requirement  in  selecting  the  range  of  the  environmental  pa¬ 
rameters  over  which  the  processor  is  designed  to  operate.  As  described  in  Subsec¬ 
tion  3.1.2,  the  processor  will  treat  any  portion  of  the  received  signal  which  falls  within 
any  rank  one  subspace  spanned  by  a  replica  vector  in  7i  ((?(/,  s))  as  desired  signal, 
and  will  select  g^j^  to  pass  as  much  of  this  signal  as  possible  through  the  resulting 
spatial  filter.  If  $  is  large,  there  will  tend  to  be  a  wide  range  of  rank  one  subspaces 
spanned  by  the  replica  vectors  in  Ti  (<j(/ji))-  In  this  case,  the  processor  may  select 
g^jj  to  lie  in  a  noisy  subspace  which  contains  very  little  of  the  power  of  the  desired 
signal;  and  the  spatial  Alter  will  pass  a  lot  of  noise  and  little  of  the  desired  signal.  If 
$  is  small,  then  H  (Q(/,i))  will  tend  to  be  small  and  the  probability  that  the  actual 
replica  of  the  desired  signal  will  not  fall  within  W  (Q(/,£))  will  increase.  In  this 
case,  mismatch  will  occur  and,  as  stated  in  the  preceding  paragraph,  the  tendency  of 
the  processor  will  be  to  reject  (Alter  out)  the  desired  signal  resulting  in  performance 
degradation.  Therefore,  it  is  important  to  balance  the  selection  of  a  large  operating 
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range  to  avoid  filtering  out  the  desired  signal  with  the  selection  of  a  small  operating 
range  to  avoid  passing  a  lot  of  noise  through  the  filter. 

The  tradeoff  in  the  selection  of  $  between  filtering  out  as  much  noise  as  possible 
and  avoiding  the  rejection  of  the  desired  signal  can  be  analyzed  using  the  probability 
of  false  alarm  (Pp)  a^d  probability  of  detection  (Pd)  performance  measures.  Assume 
that  a  fourth  stage  is  added  to  the  Adaptive  Minmax  Processor  to  perform  a  threshold 
hypothesis  test  to  estimate  whether  or  not  a  point  source  is  present  at  the  array  focal 
point.  That  is,  letting  I/q  be  the  hypothesis  that  there  is  no  point  source  at  the  array 
focal  point  (i.e.,  the  received  signal  consists  of  just  noise)  and  Hi  be  the  hypothesis 
that  there  is  a  point  source  at  the  array  focal  point  (i.e.,  the  received  signal  consists  of 
the  desired  signal  plus  noise),  the  hypothesis  test  selects  between  the  two  hypotheses 
using  the  decision  rule 


^0  :  ^*(/,«)  <  V 
Hi  :  ^^(/,2)  >  »? 


where  rj  is  the  decision  threshold.  Pf  is  defined  as  the  probability  that  the  test  will 
select  Hi  when  no  point  source  is  present  at  the  zurray  focal  point  and  Pd  is  defined 
ais  the  probability  that  the  test  will  select  Hi  when  there  is  a  point  source  present  at 
the  array  focal  point. 

is  calculated  with  the  following  two  steps: 


S.,;  =  "8  .,2  S(/)-  1 


and 


= 


Let  Sn  be  the  noise  cross-spectral  covariance  matrix  and  Sd«(^)  be  the  cross-spectral 
covariance  matrix  of  the  desired  signal  given  the  environmental  conditions  <i>.  Then, 
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Pp  and  Pd  can  be  written  as 


{ >  ^V»-y  >  ’  I 

\t„  S(/)  ’  %„  / 

=  P”**  1  >  —  >  ’>  l®(^)  =  S„  +  S4^)) 

\S.«  S(/)-*  j,..  ] 


£  I  =  1  V£  €  Q{f,z)  implies  that  |  £  |  <  1  V£  €  W  (Q(/,z)).  Therefore,  defining 


2 

mtn 


min 


i^niQif^z)) 


the  following  relationship  holds 

I  1  Imi.  ^  illi>II  <  _ ! _ 

tn  S(/)-  S.//  ■  s.‘//  S(/)-  3.,,  -  tn  S(/)-  s.//' 

Therefore,  Pp  and  Pd  can  be  bounded  by 


Pf  <  Pf..  =  Prob(3£e  «((?(/,!))  I  S(/)  =  S.) 

Pf  >  Pfi,  =  Prob(3j€H(e(/,r))s.t.s‘S(/r’2  I  S(/)  =  S„) 

Pd  <  Pd..  =  Prob  (32  eW  («(/,£))  <  V'  |  S(/)  =  S„  +  Sj.(b)) 

Pd  >  PD,.  =  Prob(32€W((?(/,i))s.t.2*S(/)-'2  <  1 2  IL»  b*'  I  S(/)  =  S„  +  Sj,(^)) 


*,  «(/,!),  1 2  |L„,  Pf..,  Pf,.,  Pd..,  Pd,,  and  (?'(/,  r),  1 2  |L,  Pf^.  Pf„. 

Pb,.  be  two  sets  of  operating  ranges  of  environmental  parameters,  replica  vec¬ 
tors,  and  associated  bounds.  Assume  that  $  C  Then  ‘7<(Q(/,i))  C  "W  ((?'(/, i)) 
and  I  £  Therefore,  Pf.*  <  Pf.*  and  Pp.^  <  Pf^^.  However,  since 

I  £  Imm^l  £  Imirn  Same  conclusion  cannot  be  drawn  about  the  corresponding  lower 
bounds.  Therefore,  as  the  operating  range  of  environmental  parameters  is  increased, 
the  upper  bounds  on  the  probabilities  of  false  sJarm  and  detection  are  increaised. 
However,  as  stated  at  the  end  of  Subsection  3.1.3,  if  $  is  increased  to  the  point  that 
I  £  Imin  becomes  small,  the  mismatch  between  £^^^  and  £^^  will  tend  to  increase  when 
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the  SNR  is  low.  This  increased  mismatch  will  result  in  a  reduction  in  ambiguity 
function  of  the  processor  at  the  source  location  and  therefore  reduce  the  probability 
of  detection.  This  analysis  is  supported  by  the  numerical  results  shown  in  Figure  3-16 
in  Subsection  3.2.1.  Therefore,  at  low  SNRs,  the  probability  of  detection  is  not  likely 
to  increase  as  much  as  would  be  inferred  from  the  increase  in  as  $  is  increased. 


3.2  Numerical  Analysis  of  the  Adaptive  Minmax 
Matched  Field  Processor 

The  analysis  of  many  characteristics  of  the  Adaptive  Minmax  Matched  Field  Pro¬ 
cessor  is  improved  considerably  by  the  inclusion  of  numerical  results.  A  principal 
reason  for  this  is  the  lack  of  illustrative  anzdyticad  solutions  to  Step  1  of  the  array 
processing  algorithm.  Several  numerical  examples  are  included  here  to  assist  in  the 
analysis  of  different  characteristics  of  the  algorithm.  Subsection  3.2.1  presents  results 
characterizing  the  performance  of  the  algorithm  in  a  deterministic  ideal  waveguide. 
In  this  case,  the  uncertain  environmental  characteristic  is  the  depth  of  the  waveguide. 
Subsection  3.2.2  presents  results  which  characterize  the  performance  of  the  algorithm 
in  a  deterministic  horizontally-stratified  ocean  with  an  arctic  sound  speed  profile. 
Here,  the  uncertdn  environmental  characteristic  is  the  sound  speed  profile  in  the 
surface  layer  of  the  ocean.  Finally,  Subsection  3.2.3  presents  results  describing  the 
performance  of  the  algorithm  in  a  randomly  time-variant  ideal  waveguide  where,  as 
before,  the  uncertain  environmental  characteristic  is  the  depth  of  the  waveguide.  In 
this  case,  the  random  perturbations  to  the  ocean  are  perturbations  to  the  sound  speed 
structure  as  presented  in  Chapter  4.  These  results  are  presented  to  allow  comparison 
of  the  performance  of  the  processor  in  deterministic  and  random  media. 

In  all  the  numerical  simulations,  an  important  parameter  is  the  signal  to  noise 
ratio.  For  the  purpose  of  these  simulations,  the  signal  to  noise  ratio  is  defined  as 
the  ratio  of  the  average  power  per  array  sensor  in  the  desired  signal  to  the  average 
power  per  array  sensor  in  the  spatially  white  sensor  noise.  For  the  case  where  there  is 
propagating  background  noise,  the  background  noise  to  sensor  noise  ratio  is  similarly 
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defined.  Thus,  if  the  SNR  is  20  dB  and  the  background  noise  to  sensor  noise  ratio  is 
10  dB,  then  the  signal  to  background  noise  SNR  is  10  dB.  In  all  cases,  the  average 
power  measurements  are  taken  after  the  temporal  processing  by  the  discrete-time 
Fourier  transform. 

None  of  the  results  presented  were  generated  with  Monte  Carlo  simulations.  In 
all  cases,  it  is  assumed  that  the  processor  has  perfect  knowledge  of  the  cross-spectral 
correlation  matrix.  That  is,  S(/)  =  S(/).  All  units  of  distance  measurement  used  in 
this  section  are  metric. 

3.2.1  The  Deterministic  Ideal  Waveguide 

The  first  set  of  numerical  results  were  generated  using  an  ideal  waveguide  model 
of  the  ocean  and  a  normal  mode  representation  of  the  propagating  signals.  (See 
Section  4.3  for  an  introduction  to  the  normal  mode  representation  of  signals).  In 
addition,  the  ocean  is  assumed  to  be  a  deterministic  (i.e.,  time-invariant)  medium. 
The  salient  parameters  of  the  simulation  are  that  the  ocean  is  assumed  to  be  an 
isovelocity  waveguide  with  a  sound  speed  of  V{z)  —  1500  m/s  where  U(z)  is  the 
time-invariant  depth  dependent  component  of  the  sound  speed  defined  in  Chapter  4, 
and  an  unknown  but  constant  depth  H  between  290  meters  and  310  meters.  The 
sea  surface  is  assumed  to  be  a  free  surface  and  the  ocean  bottom  is  2issumed  to  be 
infinitely  rigid.  The  source  is  assumed  to  be  a  deterministic  complex  exponential 
with  a  frequency  of  20  Hz.  In  addition,  the  source  is  assumed  to  be  sufficiently  far 
from  the  axray  so  that  the  horizontal  propagation  of  the  modes  can  be  modeled  by  a 
complex  exponential  rather  than  a  Hzuikel  function. 

With  these  assumptions,  the  waveguide  supports  eight  propagating  modes.  The 
mode  shapes  are  a  function  of  depth  z  and  given  by 

4>n{z)  =  sm{kv„z)  n  =  1,...,8, 

where  kv„  =  vertical  wavenumber  of  the  n‘^  mode.  The  mode  shape 

<f>n{2)  should  not  be  confused  with  the  environmental  parameter  The  propagating 


signal  at  the  range  r,  depth  z,  and  time  t  is  given  by  (4.12) 

x{r,z,t)  =  ^  sin{kv„z,ource)s\n{kv„z), 

n=sl 

where  kH„  =  \/{k^  —  ^v„)  is  the  horizontal  wavenumber  of  the  mode  tind  k  =  ^ 
is  the  wavenumber  of  the  signal. 

The  receiving  array  is  a  nine-element  vertical  array  with  the  top  element  at  30 
meters  depth  and  the  bottom  element  at  270  meters  depth.  The  inter-element  spacing 
is  therefore  30  meters.  For  all  the  examples  in  this  subsection,  the  sources  are  assumed 
to  be  at  a  range  of  50  km  from  the  array  and  there  is  assumed  to  be  no  propagating 
background  noise  (i.e.,  the  received  signal  consists  of  spatially  white  sensor  noise 
and  narrowband  signals  emitted  by  point  sources).  In  addition,  in  this  section,  the 
localization  problem  is  assumed  to  be  a  one-dimensional  (depth  only)  problem.  That 
is,  it  is  assumed  that  the  true  range  to  the  source(s)  is  known  and  the  processor  is 
trying  to  determine  the  source  depth(s).  These  assumptions  are  made  to  simplify 
the  presentation  of  the  results  and  the  analysis  of  the  salient  characteristics  of  the 
processor. 

For  the  first  results  in  this  subsection,  a  single  source  is  placed  at  a  depth  of 
150  meters  with  an  SNR  of  10  dB.  The  purpose  of  these  results  is  to  illustrate  the 
effect  which  the  ocean  depth  uncertainty  has  on  the  MVDR  Matched  Field  Processor, 
the  Bartlett  Matched  Field  Processor,  and  the  Adaptive  Minmax  Matched  Field 
Processor,  and  to  provide  a  qualitative  comparison  of  the  resolution  of  each  processor. 
Figures  3-4a  through  3-8a  show  the  depth  2unbiguity  functions  generated  by  each 
processor  for  the  cases  where  the  actual  ocean  depth  is  290  meters;  and  Figures  3-4b 
through  3-8b  show  the  depth  ambiguity  functions  generated  by  each  processor  for  the 
cases  where  the  actual  ocean  depth  is  310  meters. 

The  results  in  Figure  3-4  are  generated  using  a  MVDR  Processor  assuming  that  the 
ocean  depth  is  290  meters.  When  the  ocean  depth  assumption  is  accurate  (Figure  3- 
4a),  the  processor  generates  a  sharp  peak  in  its  ambiguity  function  at  the  true  source 
depth;  when  the  depth  assumption  is  inaccurate  (Figure  3-4b),  the  peak  level  of  the 
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ambiguity  function  drops  by  approximately  17  dB  and  there  is  no  single  significant 
peak.  The  sidelobe  suppression  of  the  perfectly  matched  processor  (approximatly 
18  dB  below  the  main  peak  level)  is  good  and  the  peak  in  the  ambiguity  function 
is  sharp.  Figure  3-5  shows  the  complementary  results  generated  using  a  MVDR 
Processor  assuming  that  the  ocean  depth  is  310  meters. 

The  results  in  Figure  3-6  are  generated  using  a  Bartlett  Processor  which  assume 
that  the  ocean  depth  is  290  meters.  As  was  the  case  with  the  MVDR  Processor, 
the  perfectly  matched  Bartlett  Processor  (Figure  3-6a)  generates  a  single  significant 
peak  at  the  true  source  depth;  the  «imbiguity  function  generated  by  the  mismatched 
Bartlett  Processor  (Figure  3-6b)  has  no  single  significant  peak.  However,  there  are 
three  significant  differences  in  the  ambiguity  functions  generated  by  the  processors. 
First,  the  sidelobe  suppression  of  the  Bartlett  Processor  is  not  nearly  as  good  as  that 
of  the  MVDR  Processor  with  the  peak  sidelobe  level  for  the  perfectly  matched  Bartlett 
Processor  only  7  dB  below  the  main  peak  level  (compared  to  18  dB  for  the  MVDR 
Processor).  Second,  the  mainlobe  width  for  the  perfectly  matched  Bartlett  Processor 
is  much  wider  than  that  generated  by  the  MVDR  Processor.  Finally,  the  mismatched 
Bartlett  Processor  does  not  experience  the  significant  loss  in  the  peak  level  of  the 
ambiguity  function  as  was  experienced  by  the  mismatched  MVDR  Processor.  These 
ambiguity  function  peaks,  generated  by  the  mismatched  processors,  are  sidelobes  in 
the  source  depth/ocean  depth  plane.  Thus,  the  low  peak  levels  in  the  ambiguity 
function  of  the  mismatched  MVDR  Processor  are  characteristic  of  the  low  sidelobe 
levels  of  MVDR  Processor  and  the  high  peak  levels  in  the  ambiguity  function  of 
the  mismatched  Bartlett  Processor  are  chMacteristic  of  the  high  sidelobe  levels  of 
the  Bartlett  Processor.  Figure  3-7  shows  the  complementary  results  generated  by  a 
Bartlett  Processor  which  assumes  that  the  ocean  depth  is  310  meters. 

Finally,  the  results  in  Figure  3-8  are  generated  using  the  Adaptive  Minmax  Pro¬ 
cessor.  In  this  case,  the  processor  assumes  that  the  ocean  depth  may  be  between 
290  and  310  meters.  With  only  this  approximate  knowledge  of  the  ocean  depth,  the 
ambiguity  function  generated  by  the  processor  shows  a  single  significant  peak  at  the 
true  source  depth  for  the  case  where  the  ocean  depth  is  290  meters  and  the  case  where 
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the  ocean  depth  is  310  meters.  The  sidelobe  suppression  (approximately  15  dB)  is 
comparable  to  that  of  the  perfectly  matched  MVDR  Processor  and  is  significantly 
better  than  that  of  the  perfectly  matched  Bartlett  Processor.  The  same  observation 
can  be  made  about  the  sharpness  of  the  mainlobe.  This  final  observation  leads  to  the 
qualitative  conclusion  that  the  resolution  of  the  Adaptive  Minmax  Processor  is  com¬ 
parable  to  that  of  the  perfectly  matched  MVDR  Processor  and  significantly  better 
than  them  of  the  perfectly  matched  Bartlett  Processor.  This  conclusion  is  supported 
quantitatively  by  results  contained  later  in  this  subsection.. 

Figure  3-9  shows  the  ambiguity  functions  of  the  perfectly  matched  MVDR  and 
Bartlett  Processors  and  the  Adaptive  Minmax  Processor  for  the  Ceise  where  the  actual 
ocean  depth  is  310  meters,  the  source  is  located  at  150  meters  depth,  and  the  SNR  is 
0  dB.  These  results  are  presented  to  illustrate  that,  in  general,  the  ambiguity  functions 
do  not  change  qualitatively  when  the  SNR  is  lowereed  to  0  dB.  One  significant  change 
is  that  the  peak  level  of  the  ambiguity  function  of  the  Adaptive  Minmax  Processor  is 
no  longer  equal  to  that  of  the  perfectly  matched  MVDR  and  Bartlett  Processors  as 
is  the  case  when  the  SNR  is  10  dB.  The  peak  level  in  this  case  has  dropped  to  about 
2.5  dB  below  that  of  the  MVDR  and  Bartlett  Processors.  This  loss  in  peak  level  will 
be  analyzed  later  in  this  subsection. 

As  demonstrated  with  the  preceding  results,  the  response  of  the  Adaptive  Minmax 
Processor  in  the  environmental  parameter  space  (in  this  case  ocean  depth)  is  consid¬ 
erably  broader  than  that  of  the  MVDR  Processor.  That  is,  the  Adaptive  Minmax 
Processor  will  detect  signals  from  a  wider  remge  of  environmental  parameters  than 
the  MVDR  Processor.  Qualitatively,  the  signal  replica  vector  is  a  relatively  smooth 
function  of  both  the  source  location  and  the  environmental  conditions.  Therefore, 
since  both  processors  characterize  signals  by  their  replica  vectors,  it  is  reasonable  to 
conclude  that  the  response  of  the  Adaptive  Minmax  Processor  in  the  source  location 
space  would  be  broader  than  that  of  the  MVDR  Processor.  The  Multiple  Constraints 
Matched  Field  Processor  [17]  makes  use  of  this  fact  and  broadens  the  response  of 
the  processor  in  the  environmental  parameter  space  by  intentionally  broadening  the 
response  of  the  processor  in  the  source  location  space.  The  result  is  a  reduction  in 
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(a)  Actual  Ocean  Depth  =  290  meters 


(b)  Actual  Ocean  Depth  =  310  meters 


Figure  3-4:  Ambiguity  Function  for  the  MVDR  Processor:  SNR  =  10  dB 
Assumed  Ocean  Depth  =  290  meters 
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(a)  Actual  Ocean  Depth  =  290  meters 


0  50  100  150  200  250  300 


DEPTH  (Mtm) 

(b)  Actual  Ocean  Depth  =  310  meters 


Figure  3-5:  Ambiguity  Function  for  the  MVDR  Processor:  SNR  =  10  dB 
Assumed  Ocean  Depth  =  310  meters 
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(a)  Perfectly  Matched  MVDR  Processor 


(b)  Perfectly  Matched  Bartlett  Processor 


(c)  Adaptive  Minmax  Processor 


Figure  3-9:  Ambiguity  Functions  for  SNR  =  0  dB  and  Ocean  Depth  =  310  meters 
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the  sensitivity  of  the  processor  to  environmental  mismatch  and  a  corresponding  re¬ 
duction  in  the  spatiaJ  resolution  of  the  processor.  Therefore,  an  issue  of  concern  is 
how  much  spatial  resolution  must  be  sacrificed  to  create  the  broad  response  of  the 
Adaptive  Minmaoc  Processor  in  the  environmental  parameter  space. 

The  next  results  axe  presented  to  allow  quantitative  comparison  of  the  resolution  of 
the  perfectly  matched  MVDR  Processor,  the  perfectly  matched  B«irtlett  Processor, 
and  the  Adaptive  Minmax  Processor.  The  actual  ocean  depth  for  this  simulation 
is  310  meters.  Two  equal  strength  sources,  each  with  a  10  dB  SNR,  are  located 
at  a  depth  of  155  ±  ^depth  meters  where  2Adepth  is  the  depth  separation  between 
the  sources.  Characteristic  ambiguity  functions  generated  by  the  Adaptive  Minmax 
Processor  for  this  two-source  case  are  shown  in  Figure  3-10.  In  Figure  3- 10a,  at  a 
source  separation  is  8  meters,  the  processor  is  unable  to  resolve  the  two  sources.  In 
Figure  3- 10b,  at  a  source  separation  is  24  meters,  the  processor  is  able  to  resolve  the 
sources. 

As  a  measure  of  a  processor’s  ability  to  resolve  sources,  the  response  ratio,  defined 
as  lOlogjo  (l?),  is  used.  For  the  case  shown  in  Figure  3-lOa,  where  there  is  a  single 
peak  in  the  ambiguity  function  between  the  two  source  locations;  is  the  peak 
value  of  the  ambiguity  function  between  the  two  source  locations.  For  the  case  shown 
in  Figure  3- 10b,  where  the  processor  resolves  two  distinct  peaks  in  the  ambiguity 
function;  is  the  minimal  value  of  the  ambiguity  function  between  the  two  source 
locations.  For  the  case  shown  in  Figure  3- 10a,  where  there  is  a  single  peak  in  the 
ambiguity  function  between  the  two  source  locations;  a]  is  the  smaller  of  the  values 
of  the  ambiguity  function  evaluated  at  the  two  source  locations.  For  the  C2ise  shown 
in  Figure  3-lOb,  where  the  processor  resolves  two  distinct  peaks  in  the  ambiguity 
function;  is  the  smaller  of  the  two  peak  values.  Therefore,  if  the  response  ratio  is 
greater  than  zero,  the  processor  is  unable  to  resolve  septate  peaks  in  the  ambiguity 
function.  If  the  response  ratio  is  less  than  zero,  the  processor  resolves  two  distinct 
peaks  in  the  ambiguity  function  and  the  depth  of  the  dip  in  the  ambiguity  function 
between  the  peaks  equals  the  response  ratio  in  dB.  Thus,  the  lower  the  response  ratio 
at  any  source  separation,  the  better  the  ability  of  the  processor  to  resolve  sources 
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with  that  separation. 

Referring  to  Figure  3-11,  the  MVDR  and  Adaptive  Minmax  Processors  are  able 
to  resolve  separate  peaks  for  source  separations  greater  than  17  and  18  meters,  re¬ 
spectively,  and  the  Bartlett  Processor  requires  approximately  28  meters  separation 
in  order  to  be  able  to  resolve  separate  peaks.  The  depth  of  the  dip  between  source 
peaks  is  between  .1  and  2  dB  less  for  the  Adaptive  Minmax  Processor  than  it  is  for 
the  MVDR  Processor.  The  depth  of  the  dip  for  the  Bartlett  Processor  is  approxi¬ 
mately  7  dB  less  than  it  is  for  the  Adaptive  Minmax  and  MVDR  Processors.  Thus, 
the  resolution  of  the  Adaptive  Minmax  Processor  is  slightly  less  than  than  that  of 
the  perfectly  matched  MVDR  Processor  but  it  is  significantly  greater  than  that  of 
the  perfectly  matched  Bartlett  Processor. 

The  preceding  results  provide  a  means  of  assessing  the  capabilities  of  the  Adap¬ 
tive  Minmax  Processor.  The  following  two  sets  of  results  illustrate  characteristics  of 
the  processor  in  order  to  better  understand  how  the  processor  functions.  The  second 
stage  of  the  processor  (Figure  2-4)  is  a  linear  spatial  filter,  the  coefficients  of  which 
are  calculated  in  Step  3  of  the  array  processing  algorithm.  For  each  array  focal  point, 
a  different  set  of  weights  are  calculated.  Just  as  a  frequecy  response  is  defined  for  any 
set  of  weights  of  a  linear  temporal  filter,  a  source  location/environmental  condition 
response  can  be  defined  for  each  set  of  weights  of  the  linear  spatial  filter.  In  the 
ideal  waveguide  example,  where  source  location  is  parameterized  by  the  source  depth 
and  the  environmental  conditions  are  parameterized  by  the  ocean  depth,  the  de¬ 
sired  response  is  the  source  depth/ocean  depth  response.  Letting  be  the 

snapshot  of  a  signal  emitted  by  a  source  at  the  depth  z  given  the  ocean  depth  H,  the 
source  depth/ocean  depth  response  for  the  weights  m  is  given  by  W{z,  H)  =  ■ 

This  magnitude  response  is  the  gain  which  the  spatial  filter  applies  to  a  signal  emitted 
by  a  source  at  depth  z  and  which  propagates  through  an  ocean  with  depth  H. 

For  the  results  shown  in  Figures  3-12  through  3-15,  a  single  source  is  located  at  250 
meters  depth  with  an  SNR  of  20  dB.  Figure  3-12  shows  the  source  depth/ocean  depth 
response  of  the  array  weights  generated  by  the  Adaptive  Minmax  Processor  when  the 
array  focal  point  is  175  meters  depth  and  the  actual  ocean  depth  is  290  meters.  The 
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t®PTH  (melert) 


(a)  Source  Depths  =  151  and  159  meters 


(b)  Source  Depths  =143  and  167  meters 


Figure  3-10:  Ambiguity  Functions  for  the  Minmax  Processor:  Two  Sources 
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Figure  3-11:  Resolution  Comparison  for  Different  Processors 

magnitude  response  evaluated  at  an  ocean  depth  of  290  meters  (Figure  3-12a)  has  a 
sharp  null  at  the  source  depth  of  250  meters.  This  is  consistent  with  the  processor 
attempting  to  null  the  signal  emitted  by  a  source  at  the  depth  of  250  meters  which 
propagates  through  an  ocean  of  depth  290  meters.  The  magnitude  response  evaluated 
at  an  ocean  depth  of  310  meters  (Figure  3-12b)  has  no  sharp  null  at  the  source  depth  of 
250  meters.  This  is  consistent  with  the  fact  that  the  received  signal  has  no  component 
which  propagated  through  an  ocean  for  which  depth  was  310  meters.  Hence,  there  is 
no  signal  with  a  replica  vector  characteristic  of  this  point  (250  meters  source  depth, 
310  meters  ocean  depth)  to  cancel.  Figure  3-13  shows  a  complementary  set  of  results 
for  the  case  where  the  actud  ocean  depth  is  310  meters.  Here,  the  sharp  null  appears 
in  the  response  evaluated  at  an  ocean  depth  of  310  meters  but  does  not  appear  in  the 
response  evaluated  at  an  ocean  depth  of  290  meters. 

Figure  3-14  shows  the  source  depth/ocean  depth  response  of  the  array  weights 
generated  by  the  Adaptive  Minmax  Processor  when  the  array  focal  point  is  250  meters 
and  the  actual  ocean  depth  is  290  meters.  The  response  evaluated  at  an  ocean  depth 
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of  290  meters  (Figure  3- 14a)  is  close  to  one  at  the  source  depth  of  250  meters.  This  is 
consistent  with  the  interpretation  of  the  spatial  filter  as  a  Two-Stage  MVDR  Processor 
where  the  filter  response  at  the  point  corresponding  to  the  effective  replica  vector 
equals  one,  and  is  also  constitent  with  the  conclusion  in  Subsection  3.1.2  that  the 
effective  replica  vector  will  be  very  close  to  the  actual  replica  vector  at  high  SNRs. 

The  response  evaluated  at  an  ocean  depth  of  310  meters  (Figure  3-14b)  is  not  close 
to  one  which  is  also  consistent  with  the  above  interpretation.  Since  the  received  signal 
contains  no  component  which  propagated  through  an  ocean  of  depth  310  meters,  the 
signal  power  which  the  processor  detects  at  this  point  is  small.  Thus  is  not 
forced  to  be  near  the  replica  vector  for  this  point,  and  the  response  at  this  point  is 
not  constrained  to  be  close  to  one.  Figure  3-15  shows  a  complementary  set  of  results 
for  the  case  where  the  actual  ocean  depth  is  310  meters. 

The  final  results  in  this  subsection  allow  an  assessment  of  how  the  difference  in  the 
norms  of  the  replicas  contained  in  Ti  ((?(/>  l))  can  adversely  affect  the  performance 
of  the  Adaptive  Minmax  Processor  at  low  SNRs.  These  results  were  generated  with  a 
source  at  250  meters  depth  and  an  actual  ocean  depth  of  310  meters.  The  focal  point 
of  the  Adaptive  Minmax  Processor  is  250  meters  depth.  For  this  particular  case, 
where  the  noise  consists  of  only  spatially  white  sensor  noise,  (3.6)  can  be  rewritten 
as 


2.//  =  arg 


min 


£€«(<?(/,!)) 


A'ft  +  l 


As  the  SNR  decreeises,  the  first  term  in  the  parenthesis  increases,  the  reduction  of  the 
norm  of  is  emph«isized,  and  the  reduction  of  the  mismatch  between  and 
(i.e.  increasing  cos^(£_^^,2^^^;  I))  is  deemphasized.  Therefore,  cos^(^^^,£^^^;  I)  should 
decrease  as  the  SNR  decrecises.  Figure  3-16a  supports  this  conclusion. 

Figures  3-14a  and  3-15b  show  that,  at  a  high  SNR,  the  source  depth/ocean  depth 
response  of  the  weights  generated  by  the  Adaptive  Minmax  Processor  equals  one 
when  the  array  focal  point  is  the  source  location  and  the  response  is  evaluated  at 
the  actual  source  depth  and  the  actual  ocean  depth.  However,  when  the  SNR  is 
decreased.  Figure  3-16b  show's  that  the  response  evaluated  at  the  actual  source  depth 
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1.6 


(a)  Response  at  Ocean  Depth  =  290  meters 


(b)  Response  at  Ocean  Depth  =  310  meters 

Figure  3-14:  Source  Depth/Ocean  Depth  Response  for  Array  Focal  Point  =  250 
meters  and  Actual  Ocean  Depth  =  290  meters 
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SOURCE  DEPTH  (meter*) 

(a)  Response  at  Ocean  Depth  =  290  meters 


SOURCE  DEPTH  (meim) 

(b)  Response  at  Ocean  Depth  =  310  meters 


Figure  3-15:  Source  Depth/Ocean  Depth  Response  for  Array  Focal  Point 
meters  and  Actual  Ocean  Depth  =  310  meters 
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and  actual  ocean  depth  drops  below  one.  This  drop  can  be  partially  attributed  to 
the  increase  in  processor  mismatch  as  measured  by  the  difference  between  and 
2^^,  which  results  from  a  decrease  in  SNR;  this  is  illustrated  in  Figure  3-16a.  The 
increased  mismatch  and  resulting  drop  in  the  magnitude  response  partially  accounts 
for  the  drop  in  the  peak  level  of  the  ambiguity  function  of  the  Adaptive  Minmax 
Processor  which  was  discussed  earlier  and  illustrated  in  Figure  3-9. 

3.2.2  The  Arctic  Ocean 

The  second  set  of  numerical  results  were  generated  modeling  the  ocean  as  a  deter¬ 
ministic  horizontally-stratified  medium  with  the  arctic  sound  speed  profile  shown  in 
Figure  3-17.  The  dominant  characteristic  of  the  sound  speed  profile  is  the  strong 
surface  duct  created  as  a  result  of  the  location  of  the  sound  speed  minimum  at  the 
surface.  Propagating  sound  will  be  refracted  to  the  surface  and  tend  to  stay  in  the 
duct  close  to  the  surface  as  it  propagates.  The  sound  speed  profile  at  depths  greater 
than  or  equal  to  85  meters  is  assumed  to  be  known  to  the  processor.  The  sound  speed 
in  the  top  85  meters  of  the  water  column  is  assumed  to  vary  in  a  pseudo-linear  fash¬ 
ion  (i.e.,  — ^  varies  linearly  with  depth)  between  an  unknown  value  at  the  surface 
and  a  known  value  at  the  depth  of  85  meters.  Therefore,  the  sound  speed  profile  is 
completely  parameterized  by  the  surface  sound  speed  ^(0)  and  $  is  the  set  of  pos¬ 
sible  values  for  ^(0).  The  salient  parameters  of  the  simulation  are  that  the  ocean 
depth  is  3800  meters,  the  sea  surface  is  assumed  to  be  a  free  surface,  and  the  ocean 
bottom  is  assumed  to  be  a  soft  bottom  so  that  no  bottom  reflected  waves  zu'e  able 
to  reach  the  array  sensors.  The  array  is  an  18  element  vertical  array  with  the  top 
element  at  a  depth  of  60  meters  and  the  bottom  element  at  a  depth  of  570  meters 
(inter-element  spacing  is  30  meters).  All  of  the  results  shown  here  were  generated 
with  a  single  omnidirectional  20  Hz  source  located  at  a  depth  of  190  meters  and  a 
Tange  of  250  km  from  the  array.  In  all  cases,  the  range  of  the  surface  sound  speed 
given  to  the  Adaptive  Minmax  Processor  ($)  is  from  ^(0)  =  1430.75  meters/second 
to  ^(0)  =  1432.25  meters/second. 

The  noise  field  consists  of  both  sensor  noise  and  surface-generated  background 


86 


1 


(a)  cos2(£^^^,£^^^;I)  vs  SNR 


(b)  Source  Depth/Ocean  Depth  Response  vs  SNR 


Figure  3-16:  Adaptive  Minmax  Processor  Charau:teristics  vs  SNR 
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Figure  3-17:  Arctic  Sound  Speed  Profile 

noise.  Consistent  with  the  surface  noise  model  in  [35],  the  surface  noise  is  modeled 
as  generated  by  a  horizontal  sheet  of  stochastic  monopole  sources  which  are  spatially 
uncorrelated  and  located  0.5  meters  below  the  surface. 

Figures  3-18  through  3-21  show  the  ambiguity  functions  for  the  M  tched  and  the 
Mismatched  MVDR  Processors,  the  Matched  Bartlett  Processor,  and  the  Adaptive 
Minmax  Processor.  In  all  cases,  the  actual  surface  sound  speed  is  1431  meters /second, 
the  SNR  is  20  dB,  and  the  background  noise  to  sensor  noise  ratio  is  10  dB.  Therefore, 
the  signal  to  background  noise  SNR  is  10  dB.  The  Mismatched  MVDR  Processor 
operates  with  an  assumed  surface  sound  speed  of  1432  meters/second  resulting  in  a 

1  meter/second  mismatch  in  surface  sound  speed.  The  first  figure  for  each  processor 
shows  the  ambiguity  function  evaluated  on  a  course  grid  (45  meters  vertical  spacing, 

2  km  horizontal  spacing)  over  the  range  of  10  to  1000  meters  depth  and  150  to  300  km 
range.  The  second  figure  for  each  processor  shows  the  ambiguity  function  evaluated 
on  a  much  finer  grid  (5  meters  vertical  spacing,  250  meters  horizontal  spacing)  over 
the  range  of  140  to  240  meters  depth  and  241  to  259  km  range. 

A  common  feature  of  all  of  the  ambiguity  functions  is  the  range-extended  band 
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of  elevated  response  in  the  upper  300  meters  of  the  ocecin.  In  other  numerical  ex¬ 
periments  not  detailed  here,  it  is  indicated  that  when  the  source  is  removed,  the 
ambiguity  functions  in  this  upper  layer  of  the  ocean  for  the  Matched  MVDR  and 
Adaptive  Minmax  Processors  drop  approximately  3  to  4  dB  in  the  regions  away  from 
the  source  location  and  drop  approximately  17  dB  at  the  source  location.  The  am¬ 
biguity  functions  for  the  no-source  case  shows  levels  in  the  upper  250  meters  of  the 
ocean  to  be  approximately  4  dB  above  those  in  the  ocean  below  350  meters  depth 
for  both  the  MVDR  and  the  Adaptive  Minmax  Processors.  Therefore,  the  elevated 
response  in  this  region  is  partially  due  to  the  inability  of  the  processors  to  resolve 
the  source  location  in  rzuige  and  partially  due  to  the  presence  of  surface-generated 
background  noise.  The  Matched  Bartlett  Processor  shows  a  comparable  ambiguity 
function  in  the  no-source  case  in  the  upper  layer  of  the  ocean.  In  the  lower  layer  of 
the  ocean  for  the  no-source  case,  the  level  of  the  ambiguity  function  for  the  Bartlett 
Processor  is  only  1  to  2  dB  below  that  in  the  upper  layer,  which  is  consistent  with  the 
inferior  depth  resolution  of  the  Bartlett  Processor  when  compared  to  the  MVDR  and 
the  Adaptive  Minmax  Processors.  When  the  source  is  added,  the  ambiguity  function 
in  the  entire  upper  layer  increases  by  approximately  14  db;  the  ambiguity  function 
in  the  lower  layer  increases  by  approximately  4  dB.  Therefore,  the  high  level  of  the 
ambiguity  function  of  the  Bartlett  Processor  in  the  upper  layer  of  the  ocean  is  due 
primarily  to  the  poor  range  resolution  of  the  Bartlett  Processor  (i.e.,  high  sidelobes). 

The  ambiguity  functions  for  the  Matched  MVDR  Processor  (Figure  3-18),  the 
Matched  Bartlett  Processor  (Figure  3-20),  and  the  Adaptive  Minmax  Processor  (Fig¬ 
ure  3-21)  have  global  maxima  at  the  true  source  location  at  a  normalized  level  of 
approximately  0  dB.  The  Mismatched  MVDR  Processor  (Figure  3-19)  has  a  global 
maximum  at  a  depth  of  185  meters  and  range  250.25  km  at  a  norm2dized  level  of 
-1.15  dB.  The  slight  mismatch  of  1  meter/second  in  surface  sound  speed  results  in  a 
slight  offset  of  the  peak  location  and  a  loss  of  1.15  dB  in  the  peak  response  of  the 
MVDR  Processor.  Consistent  with  the  spatial  resolution  results  in  Subsection  3.2.1, 
the  mainlobe  of  the  Adaptive  Minmax  Processor  is  slightly  broader  than  the  mainlobe 
the  MVDR  Processor  and  considerably  narrower  than  the  mainlobe  of  the  Bartlett 


Processor. 


The  array  gain  defined  in  Subsection  3.1.2  is  a  good  measure  of  a  processor’s  ability 
to  separate  a  desired  signal  from  the  noise  in  which  it  is  embedded.  Figure  3-22  shows 
the  array  gain  of  the  Matched  and  the  Mismatched  MVDR  Processors,  the  Matched 
and  the  Mismatched  Bartlett  Processors,  and  the  Adaptive  Minmax  Processor  as  a 
function  of  SNR  for  several  surface  sound  speeds  when  the  array  focal  point  is  the 
source  location.  In  all  cases,  the  mismatched  processors  operate  with  an  assumed 
stirface  sound  speed  of  1432  meters/second  and  the  background  noise  to  sensor  noise 
ratio  is  10  dB.  In  all  the  figures,  the  independent  variable  is  source  to  background 
noise  SNR  rather  than  the  source  to  sensor  noise  SNR.  The  array  gain  shown  has 
been  normalized  so  that  the  gain  of  the  Matched  Bartlett  Processor  equals  one. 


For  all  SNRs  and  surface  sound  speeds  shown,  the  array  gain  of  the  Adaptive 
Minmax  Processor  is  less  than  the  array  gain  the  Matched  MVDR  Processor  and 
greater  tham  the  array  gaun  of  the  Matched  MVDR  Processor.  As  the  amount  of  the 
mismatch  between  the  surface  sound  speed  assumed  by  the  mismatched  processors 
and  the  actual  surface  sound  speed  increases,  the  gain  of  the  Mismatched  MVDR 
Processor  falls  when  compared  to  the  Matched  MVDR  Processor  and  the  Adaptive 
Minmax  Processor.  As  the  SNR  increases,  a  loss  of  array  gain  in  the  Adaptive  Minmax 


Processor  results  from  the  small  mismatch  between  and  An  interesting  note 
is  that,  for  small  amounts  of  mismatch  between  the  assumed  and  the  actual  surface 


sound  speeds,  the  gain  of  the  Mismatched  Bartlett  Processor  is  greater  than  that 


of  the  Matched  Bartlett  Processor.  This  result  would  be  impossible  in  a  spatially 


white  noise  field  because  the  Matched  Bartlett  and  the  Matched  MVDR  Processors 


maximize  the  array  gain  when  the  noise  field  is  spatially  white.  The  equivalence  of 
the  Matched  Bartlett  and  the  Matched  MVDR  Processors  in  this  case  is  based  upon 
the  assumption  that  S(/)  =  S(/). 

In  Subsection  3.1.3,  it  is  concluded  that,  for  the  case  where  the  array  focal  point 


is  the  source  location,  the  mismatch  between  and  will  decrease  as  the  SNR 
increases  and,  as  a  result,  will  increase  as  the  SNR  increases. 

Figure  3-23,  which  shows  the  generalized  cosine-squared  between  and 
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Figure  3-21:  Ambiguity  Function  for  the  Adaptive  Minmax  Processor 
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Figure  3-22:  Array  Gain  vs  SNR  for  Various  Surface  Sound  Speeds 


95 


function  of  the  source  to  background  noise  SNR  for  various  surface  sound  speeds  when 
the  array  focal  point  is  the  source  location,  supports  this  conclusion.  However,  the  re¬ 
sults  in  Subsection  3.2.1  demonstrated  the  same  dependence  of  Sn(/)~^) 

on  the  SNR  for  the  case  where  S„(/)  =  I.  Therefore,  it  is  unclear  as  to  whether 
the  cause  of  this  dependence  is  the  processor  attempting  to  reduce  |  P  or  it  is 

fl**  Sn(/)~‘  9 

the  processor  attempting  to  reduce  The  role  of  these  quantities  can 

2.// 

be  seen  in  (3.6). 

The  Adaptive  Minmax  Processor  adjusts  by  adjusting  the  least  favorable 
pmf  actual  surface  sound  speed  changes,  it  is  reasonable  to  expect 

that  when  the  array  is  focused  at  the  source  location,  will  change  to  exhibit  a 
peak  in  the  neighborhood  of  the  actual  surface  sound  speed.  Numerical  simulations 
were  run  to  test  whether  or  not  the  processor  exhibits  this  behavior.  In  these  tests, 
there  is  no  propagating  background  noise,  the  signal  to  sensor  noise  SNR  is  10  dB, 
and  the  source  location  and  array  focal  point  are  both  at  a  range  of  250  km  and  a 


depth  of  190  meters.  The  environmental  parameter  set  $  is  sampled  at  seven  points 
(1430.75  meters/second  through  1432.25  meters/second  in  0.25  meters/second  incre¬ 
ments)  and  the  actual  surface  sound  speed  is  varied  between  1430.75  meters/second 
and  1432.25  meters/second  in  0.125  meters/second  increments.  As  shown  in  Fig¬ 
ures  3-24  and  3-25,  the  Adaptive  Minmax  Processor  exhibits  the  expected  behavior 
while  the  actual  surface  sound  speed  is  in  the  lower  part  of  this  range.  However,  when 
the  actual  surface  sound  speed  is  in  the  range  between  1431.75  meters/second  and 
1432.0  meters/second,  does  not  peak  in  the  neighborhood  of  the  actual  surface 
sound  speed.  However,  the  results  presented  in  Figure  3-26  and  discussed  in  the  fol¬ 
lowing  paragraph  indicate  that  this  failure  of  to  peak  as  expected  for  some  values 
of  the  actual  surface  sound  speed  does  not  adversely  affect  the  performance  of  the 


Adaptive  Minmax  Processor. 

In  Subsection  3.1.4  the  effect  on  the  performance  of  the  Adaptive  Minmax  Pro¬ 
cessor  of  the  range  and  sampling  density  of  the  environmental  parameter  set  $  is 
qualitatively  analyzed.  A  set  of  simulations  were  run  with  different  ranges  and  sam¬ 
pling  densities  to  quantitatively  evaluate  the  effect.  The  test  conditions  are  a  signal  to 
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Figure  3-23:  vs  SNR  for  Various  Surface  Sound  Speeds 
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Figure  3-24:  Least  Favorable  PMF  for  Various  Actual  Surface  Sound  Speeds 
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surface  noise  SNR  of  10  dB  and  a  signal  to  sensor  noise  SNR  of  20  dB.  A  single  source 
is  located  at  a  range  of  250  km  and  a  depth  of  190  meters.  Tests  were  run  for  actual 
surface  sound  speeds  of  1431.5  meters/second,  1431.625  meters/second,  1431.75  me¬ 
ters/second,  1431.875  meters/second,  1431.9375  meters/second,  1432.0  meters/second, 
1432.125  meters/second,  and  1432.25  meters/second  and  for  an  array  focal  point 
of  a  range  of  250  km  and  a  depth  of  190  meters  (i.e.,  the  source  location).  The 
peak  response  loss,  which  is  defined  as  the  estimated  average  signal  power  generated 
by  the  Adaptive  Minmax  Processor  minus  that  generated  by  the  Matched  MVDR 
Processor,  is  plotted  versus  actual  surface  sound  speed  in  Figure  3-26.  The  three 
cases  shown  are  for  the  sampling  of  $  at  two  points  (1431.0  meters/second  and 
1432.0  meters/second),  three  points  (1431.0  meters/second,  1431.5  meters/second, 
and  1432.0  meters/second),  and  seven  points  (1430.75  meters/second  through  1432.25  me¬ 
ters/second  in  0.25  meters/second  increments).  The  Adaptive  Minmax  Processor 
shows  almost  no  loss  in  response  relative  to  the  MVDR  Processor  for  all  cases  when 
the  actual  surface  sound  speed  falls  within  the  interval  spanned  by  the  samples  of 
the  surface  sound  speed  parameter  set  This  indicates  that  for  this  class  of  envi¬ 
ronmental  uncertainties,  the  performance  of  the  Adaptive  Minmaoc  Processor  is  fairly 
insensitive  to  changes  in  the  szunpling  density  of  $.  However,  when  the  actual  sur¬ 
face  sound  speed  falls  outside  the  interval  spanned  by  the  the  samples  of  the  surface 
sound  speed  parameter  set  $  (i.e.,  ^(0)  >  1432.0  meters/second  for  the  two  and  three 
sample  cases),  the  Adaptive  Minmax  Processor  shows  a  dramatic  loss  in  performance 
as  measured  by  the  peak  response  loss.  For  this  same  range,  the  Adaptive  Minmax 
Processor  utilizing  seven  sample  of  9  shows  no  loss  in  performance.  Therefore,  as 
predicted  in  Subsection  3.1.4,  the  Adaptive  Minmax  Processof  suffers  a  significant 
degradation  in  performance  when  the  actual  environmental  conditions  which  are  en¬ 
countered  fall  outside  the  range  of  the  sampled  environmental  parameter  set  $. 

The  norm-squared  of  the  array  weight  vector  of  a  linear  weight-and-sum  beam- 
former  is  often  used  as  a  measure  of  the  sensitivity  of  the  processor  to  environmental 
mismatch  [8].  However,  the  use  of  this  measure  depends  upon  the  processor  using 
a  replica  vector  which  is  calculated  a  priori  rather  than  adaptively  as  is  done  in  the 


Figure  3-26:  Peak  Response  Loss  for  Various  Numbers  of  Environmental  Samples 

Two-Stage  MVDR  Processor  interpretation  of  the  Adaptive  Minmax  Processor. 

A  set  of  tests  were  run  to  measure  the  norm-squared  of  the  weight  vectors  of  the 
Matched  MVDR  and  Adaptive  Minmax  Processors  under  varous  conditions.  In  all 
cases,  the  signal  to  surface  noise  SNR  is  10  db,  the  signal  to  sensor  noise  SNR  is 
20  db,  the  source  is  at  a  range  of  250  km  and  a  depth  of  190  meters,  and  the  set  $ 
is  sampled  at  seven  points  as  described  in  the  preceeding  paragraph.  Tables  3.1  and 
3.2  list  the  norm-squared  of  the  array  weight  vectors  for  the  MVDR  and  Adaptive 
Minmax  Processors  when  the  range  of  the  array  focal  point  is  250  km  and  the  focal 
point  depth  and  actual  surface  sound  speed  take  on  several  values.  Note  that  the 
norm-squared  of  the  weight  vectors  for  the  two  processors  are  comparable  under 
equivalent  conditions.  The  large  norm-squared  when  the  depth  of  the  array  focal 
point  is  165  meters  can  be  attributed  to  the  processors  attempting  to  null  out  the 
source  at  190  meters  depth  while  maintaining  a  reasonable  response  (equal  to  one 
in  the  MVDR  case)  to  a  signal  emitted  by  a  source  at  165  meters  depth.  Under 
the  standard  criterion,  these  weight  vector  norms  would  indicate  that  the  MVDR 
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focal  depth 

140  meters 

165  meters 

190  meters 

c(0)  =  1430.75  m/s 

11.41 

24.67 

6.11 

c(0)  =  1431.375  m/s 

13.19 

36.61 

4.49 

c(0)  =  1431.9375  m/s 

11.37 

30.39 

1.18 

c(0)  =  1432.00  m/s 

18.93 

39.15 

3.13 

Table  3.1;  Norm-Squared  of  Matched  MVDR  Processor  Weight  Vectors 


focal  depth 

140  meters 

165  meters 

190  meters 

c(0)  =  1430.75  m/s 

12.19 

24.66 

5.70 

c(0)  =  1431.375  m/s 

14.13 

35.64 

1.22 

c(0)  =  1431.9375  m/s 

11.03 

30.55 

1.24 

c(0)  =  1432.00  m/s 

20.29 

39.13 

1.12 

Table  3.2:  Norm-Squared  of  Adaptive  Minmax  Processor  Weight  Vectors 

and  Adaptive  Minmax  Processors  are  equally  sensitive  to  environmental  mismatch. 
This  is  consistent  with  the  loss  in  performance  by  the  Adaptive  Minm<ix  Processor 
when  the  actual  environmental  condition  encountered  falls  outside  the  range  of 
However,  the  results  presented  in  this  chapter  indicate  that  when  the  environmental 
conditions  fall  within  the  range  of  the  Adaptive  Minmax  Processor  does  not  suffer 
a  degradation  in  performance  as  the  environmental  conditions  change.  Therefore,  the 
norm-squared  of  the  array  weight  vector  is  an  inappropriate  measure  of  the  capability 
of  the  Adaptive  Minmax  Processor  to  adjust  to  changing  environmental  conditions 
within  the  range  of  but  is  a  good  measure  of  the  processor’s  loss  of  performance 
when  the  environmental  conditions  fall  outside  the  range  of 


3.2.3  The  Random  Ideal  Waveguide 

This  final  set  of  numerical  results  were  generated  using  an  ideal  waveguide  model 
of  the  ocean  and  a  normal  mode  representation  of  the  propagating  signals.  The 
ocean  is  assumed  to  be  a  randomly  time-variant  medium  with  the  salient  parameters 
of  the  simulation  being  the  same  as  in  Subsection  3.2.1.  That  is,  the  deterministic 
component  of  the  ocean  is  assumed  to  be  an  isovelocity  waveguide  with  a  sound  speed 
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of  77  =  1500  m/s  and  an  unknown  but  constant  depth  H  between  290  meters  and 
310  meters.  The  sea  surface  is  assumed  to  be  a  free  surface  and  the  ocean  bottom  is 
assumed  to  be  infinitely  rigid.  The  source  is  assumed  to  be  a  deterministic  complex 
exponential  with  a  frequency  of  20  Hz.  In  addition,  the  source  is  assumed  to  be 
sufficiently  far  from  the  array  so  that  the  horizontal  propagation  of  the  modes  can  be 
modeled  by  a  complex  exponential  rather  than  a  Hankel  function. 

The  receiving  array  is  also  the  same  as  in  Subsection  3.2.1.  It  is  a  nine-element 
vertical  array  with  the  top  element  at  30  meters  depth,  the  bottom  element  at  270 
meters  depth,  and  an  inter-element  spacing  of  30  meters.  In  all  cases  here,  there  is  a 
single  source  at  a  depth  of  150  meters  and  at  a  range  of  50  km  from  the  array.  There 
is  assumed  to  be  no  propagating  background  noise  and  the  localization  problem  is 
assumed  to  be  a  one-dimensional  (depth  only)  problem. 

As  before,  the  deterministic  component  of  the  waveguide  supports  eight  propa¬ 
gating  modes.  The  mode  shapes  are  a  function  of  depth  z  and  given  by 

<f)n{z)  =  sin(kv„z)  n  =  1,...,8. 

From  (4.10)  the  deterministic  component  of  the  mode  of  the  desired  signal  received 
at  the  array  sensor  is,  to  within  a  scale  factor,  given  by 


where  R  is  the  horizontal  range  from  the  source  to  the  array  sensors  and  Zj  is  the 
depth  of  the  array  sensor. 

The  second  order  characterization  of  the  reduced  wavefunctions  V’n  used  here  dif¬ 
fers  slightly  from  that  given  in  Section  4.3.  The  key  difference  is  that,  in  Section  4.3, 
the  modal  phases  are  assumed  to  be  incoherent  between  modes.  Here,  it  is  assumed 
that  the  phase  of  the  reduced  wavefunction  of  the  mode  at  a  range  R  is  given 
by  L^n{RyQ)  =  Bn{R)  where  9n(R)  is  a  zero-mean  Gaussian  random  variable  with 
E[5n(^)  ^m(^)]  =  j  6nm-  P  IS  the  modal  phase  decorrelation  range.  Then,  using 
the  identity  E[c''*]  =  c~^  for  ^  a  zero- mean  Gaussian  random  variable  with  vari- 
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ance  a^,  the  cross-correlation  function  of  the  reduced  wavefunction  required  for  the 
computation  of  the  replica  vector  is  assumed  to  be  given  by 


E[UR,0)rM0)]  = 


f  sin(Arv’^Z^o^yee)  \  /  sin(^v^ZjoMree)  \ 

\|  8in(*:v„  ^source  )  \)  VisMfcv'*  ^Bouree)  \) 


Wn{R)  is  the  power  in  the  n**  mode  at  range  R  and  evolves  according  to  the  differ¬ 
ential  equation  in  (4.20). 

With  these  assumptions  and  letting 


(W^iR)\^  (  sm{kv^z,^rc.)  \ 
\  R  ^Hn  )  \  sin(fcv’n^*ource)  \j 


e“-'*""^sin(^v„^0> 


the  ensemble  cross-spectral  correlation  matrix  of  the  desired  signal  is  given  by 

s,„(/)  =  Eix,(/)jr;(/)]  =  SE  v.(fl,n)u;(A,m)  £-«"-*■">  (3.7) 

n=l  m=l 


The  element  of  the  signal  replica  vector  is,  prior  to  normalization,  equal  to  S,.*(/) 
evaluated  at  the  appropriate  set  of  environmental  conditions.  For  the  case  where 
R'>  0,  the  modal  phases  are  incoherent,  c”^  «  0,  and  (3.7)  reduces  to 


Sx.,(/)  =  53  u,(ft,n)  v;(/Z,n)  =  s*"(*V'„^i)sin(fcv„cj)W^7.(^), 

n=l  nsl 

which,  when  evaluated  at  /  =  /;  yields  (4.22).  For  the  case  where  R  ^  0,  the  modal 
phases  are  perfectly  coherent,  c~^  «  1,  and  (3.7)  reduces  to 

s.„(/)  =  (e  v,{R,n)j  (e  -"(^.n))  • 

The  performance  of  the  MVDR,  the  Bartlett,  and  the  Adaptive  Minmax  Pro¬ 
cessors  will  be  shown  to  suffer  when  modal  phase  coherence  is  reduced.  There  are 
several  reasons  for  the  performance  loss.  The  first  is  that,  as  modal  phase  coherence 
is  reduced,  the  correlation  among  the  desired  signal  as  received  at  each  of  the  array 
sensors  is  reduced.  As  a  result,  the  cross-spectral  correlation  matrix  of  the  desired 
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signal  will  begin  to  resemble  the  cross-spectral  correlation  matrix  of  the  propagating 
background  noise  ^n  the  limiting  case  of  a  total  loss  of  correlation  among  the  desired 
signal  as  received  at  each  of  the  array  sensors,  the  cross-spectral  correlation  matrix 
of  the  desired  signal  will  be  the  same  as  the  cross-spectral  correlation  matrix  of  sen¬ 
sor  noise.  In  this  limiting  case  (assuming  that  the  noise  field  consists  of  only  sensor 
noise),  the  array  gain 


G(m)  = 


Sxif)  m 
is!"  Snif)  m 


(3.8) 


will  be  one  for  all  array  weight  vectors  (i.e.,  the  array  processing  yields  no  improve¬ 
ment  in  SNR).  In  (3.8),  the  cross-spectral  correlation  matricies  are  normalized  to 
have  traces  equal  to  one.  In  the  more  general  case,  as  signal  coherence  is  lost  and 
Sxif)  becomes  more  like  Snif),  maximum  array  gain  achievable  by  any  linear 
array  processor  will  be  reduced.  This  is  a  fundamental  limitation  on  the  performance 
of  any  array  processor  which  utilizes  a  linear  filter  to  achieve  spatial  discrimination. 

The  second  cause  of  performance  loss  is  particular  to  linear  processors  using  the 
same  signal  model  as  that  used  by  the  MVDR,  the  Bartlett,  and  the  Adaptive  Min- 
max  Processors.  For  the  case  where  R  <.  0,  the  cross-spectral  correlation  matrix  of 
the  desired  signal  and  the  signal  subspace  will  be  rank  one.  In  addition,  the  normal¬ 
ized  signal  replica  vector  will  be  the  eigenvector  which  corresponds  to  the  non-zero 
eigenvalue  of  Si(/).  However,  for  the  case  where  0,  the  cross-spectral  correlation 
matrix  and  the  signal  subspace  can  have  a  rank  of  up  to  M.  The  MVDR,  the  Bartlett, 
and  the  Adaptive  Minmax  Processors  all  model  the  signal  subspace  as  having  rank 
one  and  attempt  to  pass  only  those  signals  falling  within  that  subspace.  Thus,  at 
ranges  where  modal  coherence  is  reduced,  and  more  generally  for  situations  where 
signal  coherence  across  the  array  is  rvxluced,  these  processors  can  miss  a  significant 
portion  of  the  power  in  the  desired  signal.  In  this  case,  the  MVDR  and  the  Adap¬ 
tive  Minmax  Processors  will  attempt  to  adapt  and  filter  out  a  significant  portion  of 
the  desired  signal.  This  problem  is  a  model  mismatch  problem  since  it  results  in  a 
mismatch  between  the  assumed  and  the  actual  second-order  statistics  of  the  desired 
signal. 
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To  illustrate  the  performzuice  degr2idatioD  of  the  processors  as  modal  coherence 
is  lost,  several  tests  were  run  with  different  modal  phase  decorrelation  ranges  and 
without  modal  coupling.  That  is, 


f- 


sin  (  2«oiiree  ) 


sin(A:v„  ^source)  \ 


j  =  si 


sin(fcv„ 


While  this  assumption  of  no  modal  coupling  in  a  random  waveguide  is  unrealistic, 
it  is  made  in  order  to  isolate  the  cause  of  the  performance  degradation  as  the  loss 
of  modal  phase  coherence.  The  introduction  of  modal  coupling  can  be  expected 
to  further  degrade  the  depth  resolution  of  the  processors  because  the  source  depth 
information  is  carried  in  the  relative  amplitudes  of  the  modal  excitations  [36];  and 
when  modal  coupling  is  present,  the  modal  powers  will  tend  to  an  equilibrium  state 
which  may  not  depend  on  the  initial  levels  of  modal  excitation.  For  example,  in  the 
case  where  all  of  the  modal  coupling  parzuneters  Cnm  defined  in  (4.21)  equal  one, 
the  equilibium  state  of  the  modal  powers  is  Wn{R)  =  Wm{R)  for  all  n  and  m.  In 
this  case,  the  processor  will  not  have  depth  resolution  once  the  equilibrium  state  is 
reached. 

As  a  measure  of  the  extent  to  which  the  modal  phase  decorrelation  affects  the 
distribution  of  signal  power  among  the  eigenvectors  of  the  cross-spectral  correlation 
matrix  of  the  desired  signal.  Figure  3-27  shows  the  relationship  between  the  maximum 
eigenvalue  of  the  normalized  cross-spectral  correlation  matrix  expressed  in  dB  (i.e., 
10  *  logio(‘^mox))  and  the  extent  of  the  modal  phase  decorrelation  as  measured  by 
c~^.  c~^  =  1  indicates  perfectly  correlated  modal  phases  and  c~^  =0  indicates 
uncorrelated  modal  phases.  The  cross-spectral  correlation  matrix  is  normalized  so 
that  the  sum  of  the  eigenvalues  equals  one.  This  maximum  eigenvalue  is  the  maximum 
signal  power  which  exists  in  any  rank  one  subspace.  As  the  maximum  eigenvalue 
decreases,  the  signal  power  lost  by  any  linear  processor  which  assumes  a  rank  one 
signal  subspace  will  increase. 

Figures  3-28  through  3-31  show  the  ambiguity  functions  for  the  MVDR,  the 
Bartlett,  and  the  Adaptive  Minmax  Processors  for  the  cases  where  0  =  oo,  2/2, 


Figure  3-27:  Maximum  Cross-Spectral  Correlation  Matrix  Eigenvalues  vs  Modal 
Phase  Decorrelation 

R,  and  0  (c”^  =  1,  .607,  .368,  and  0,  respectively).  ^  =  oo  corresponds  to  the 
deterministic  ideal  waveguide,  the  analysis  of  which  is  presented  in  Subsection  3.2.1. 
^  =  0  corresponds  to  the  modal  phase  decorrelation  model  2issumed  in  [32].  All 
of  the  processors  ua^;  the  correct  values  for  The  ocean  depth  in  all  cases  is  310 
meters.  In  the  figures,  the  terms  “Matched”  and  “Mismatched”  refer  to  the  assumed 
ocean  depth  used  by  the  processors  in  calculating  the  signal  rephca  vectors.  The 
“Matched”  processors  use  the  correct  value  of  the  ocean  depth.  The  “Mismatched” 
processors  use  an  assumed  ocean  depth  of  290  meters.  The  localization  performance 
of  the  Matched  MVDR  Processor,  the  Matched  Bartlett  Processor,  and  the  Adaptive 
Minmax  Processor  shows  a  significant  deterioration  as  the  modal  phase  correlation 
decreases 

The  evolution  of  the  ambiguity  functions  of  the  Matched  MVDR,  the  Matched 
Bartlett,  and  the  Adaptive  Minmax  Processors  as  /?  -+  0  is  characteristic  of  what 
could  be  expected  with  increasing  model  mismatch.  The  model  mismatch  is  not 
caused  by  an  incorrect  assumed  value  of  the  ocean  depth  or  of  but  by  an  incorrect 
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implicit  model  of  the  signal  subspace  as  having  rank  one.  The  ambiguity  functions 
of  these  processors  in  Figures  3-29  through  3-31  show  reduced  peak  levels  when  com¬ 
pared  to  those  shown  in  Figure  3-28  and  show  peak  locations  which  are  slightly  offset 
from  the  true  source  depth  of  150  meters. 

The  model  mismatch  problems  which  the  MVDR,  the  Bartlett,  and  the  Adaptive 
Minmax  Processors  have  in  the  random  waveguide  motivates  a  modification  to  the 
definition  of  the  replica  vector  which  slightly  improves  the  performance  of  the  proces¬ 
sors  in  random  media.  The  modification  is  motivated  by  the  observation  that  the  rank 
one  signal  subspace  containing  the  greatest  amount  of  signal  power  is  that  spanned 
by  the  eigenvector  of  the  cross-spectral  correlation  matrix  of  the  desired  signal  which 
corresponds  to  the  largest  eigenvalue.  Thus,  the  replica  vector  is  redefined  to  be  the 
eigenvector  which  corresponds  to  the  largest  eigenvalue.  The  magnitude  normaliza¬ 
tion  convention  applied  to  the  replica  vector  is  that  it  has  a  norm  of  one,  and  the 
phase  normalization  convention  is  that  the  element  corresponding  to  the  reference 
sensor  is  real  and  non-negative.  In  the  case  where  the  signal  subspace  has  rank  one, 
this  definition  of  the  replica  vector  is  equivalentto  the  definition  of  the  replica  vector 
contained  in  Subsection  3.1.3. 

Figures  3-32  through  3-35  show  the  ambiguity  functions  generated  by  processors 
using  this  new  definition  of  the  replica  vector  for  test  conditions  which  are  identical  to 
those  used  to  generate  Figures  3-32  through  3-35.  As  was  the  case  with  the  previous 
set  of  results  generated  using  the  definition  of  the  replica  vector  from  Subsection  3.1.3, 
the  ambiguity  function  of  the  Adaptive  Minmax  Processor  shows  characteristics  which 
are  superior  to  either  of  the  mismatched  processors.  For  the  Matched  MVDR  Proces¬ 
sor,  the  Matched  Bartlett  Processor,  and  the  Adaptive  Minmax  Processor,  the  peak 
levels  of  the  ambiguity  functions  are  greater  than  those  in  the  previous  set  of  results. 
With  the  exception  of  the  case  of  the  Adaptive  Minmax  Processor  for  =  0,  the 
peaks  of  the  ambiguity  functions  occur  at  the  true  source  depth.  The  peak  in  the 
ambiguity  function  of  the  Adaptive  Minmax  Processor  for  the  case  where  ^  =  0 
occurs  at  a  depth  of  147.5  meters  compared  with  the  true  source  depth  of  150  meters. 
These  results  indicate  a  reduction  in  the  “mismatch”  caused  by  the  implicit  rank  one 
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(a)  Matched  MVDR  Processor 


(b)  Mismatched  MVDR  Processor 


0  so  100  ISO  200  2$0  900  0  SO  100  190  200  290  SOC 

DtPTN  (MM)  DIPTN 


(c)  Matched  Bartlett  Processor  (d)  Mismatched  Bartlett  Processor 


0  so  100  190  200  200  SOO 

Dtmi 


(e)  Adaptive  Minmax  Processor 


Figure  3-28:  Ambiguity  Functions  for  =  oo 
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(a)  Matched  MVDR  Processor 


(b)  Mismatched  MVDR  Processor 


(c)  Matched  Bartlett  Processor 


(d)  Mismatched  Bartlett  Processor 


(e)  Adaptive  Minmax  Processor 


Figure  3-29:  Ambiguity  Functions  for  0  =  2R 

no 


eo-«B» 


Dim  ^m)  BIPTM 


(a)  Matched  MVDR  Processor  (b)  Mismatched  MVDR  Processor 


(c)  Matched  Bartiett  Processor  (d)  Mismatched  Bartlett  Processor 


Mm  Ima) 


(e)  Adaptive  Minmax  Processor 


Figure  3-30:  Ambiguity  Functions  (or  ^  =  R 
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(a)  Matched  MVDR  Piocesaor  (b)  Mismatched  MVDR  Processor 


orrrir  ptmt 

(c)  Matched  Bartlett  Processor  (d)  Mismatched  Bartlett  Processor 


0  SO  100  190  200  no  9Q0 

MfTII 


(e)  Adaptive  Minmax  Processor 


Figure  3-31:  Ambiguity  Functions  for  ^  «  0 
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signal  subspace  assumption. 

The  peaks  of  the  ambiguity  functions  of  the  Matched  MVDR  and  Bartlett  Pro¬ 
cessors  for  the  cases  where  0  <  oo,  when  compared  to  the  peak  value  for  the  case 
where  =  oo,  show  a  loss  which  is,  to  within  0.08  dB,  equal  to  the  reduction  in  the 
magnitude  of  the  maximum  eigenvalue  of  the  normalized  cross-spectral  correlation 
matrix  for  the  appropriate  values  of  /3  (Figure  3-27).  This  indicates  that  these  pro¬ 
cessors  are  correctly  estimating  the  power  contained  in  the  rank  one  signal  subspace 
spanned  by  the  replica  vector.  However,  the  peak  value  losses  for  the  Adaptive  Min- 
max  Processor  are  greater  than  those  which  would  be  predicted  by  the  reduction  in 
the  maximum  eigenvalue  of  the  cross-spectral  correlation  matrix.  This  increased  loss 
can  be  explained  using  the  results  from  Subsection  3.2.1  for  the  deterministic  ideal 
waveguide.  Figure  3-16  shows  the  source  depth/oce£m  depth  response  evaluated  at 
the  true  source  depth  and  ocean  depth  of  the  weights  of  the  Adaptive  Minmaix  Pro¬ 
cessor  when  the  focal  point  is  the  source  depth.  This  response  drops  in  magnitude 
as  the  SNR  decreases.  In  this  case,  the  decrease  in  the  power  in  the  rank  one  signal 
subspace  spanned  by  the  eigenvector  corresponding  to  the  maximum  eigenvalue  and 
the  corresponding  increase  in  the  power  in  the  other  portions  of  the  signal  subspace  is 
interpreted  by  the  processor  as  a  decrease  in  SNR.  Therefore,  the  magnitude  response 
of  the  processor  at  the  source  location  when  the  processor  is  focused  on  the  source 
location  drops.  The  peak  in  the  ambiguity  function  will  exhibit  a  corresponding  drop. 

Sidelobe  suppression  remains  a  significant  problem  in  all  of  the  processors.  This 
indicates  that  the  eigenvector  corresponding  to  the  maximum  eigenvalue  of  the  cross- 
spectral  correlation  matrix  is  an  ambiguous  characterization  of  the  spatial  structure 
of  the  desired  signal  as  a  function  of  source  depth  for  cases  where  the  signed  subspace 
has  a  rank  greater  than  one.  Future  work  on  this  problem  is  discussed  in  Chapter  5. 


3.3  Algorithm  Complexity 

The  solution  of  the  minmax  problem  in  Step  1  of  the  Adaptive  Minmax  Matched 
Field  Processor  is  by  far  the  most  computationally  complex  of  the  tasks  which  must 
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eiPTM  Dtmi  tetM) 


(a)  Matched  MVDR  Processor  (b)  Mismatched  MVDR  Processor 


DtPTN  Cb*«i«)  depth  Imw) 


(c)  Matched  Bartlett  Processor 


(d)  Mismatched  Bartlett  Processor 


(e)  Adaptive  Minmax  Processor 


Figure  3-32:  Ambiguity  Functions  for  ^  =  oo  with  redefined  Replica  Vector 
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(c)  Matched  Bartlett  Processor 


(d)  Mismatched  Bartlett  Processor 


0  90  too  900  290  900 

•tmi 


(e)  Adaptive  Minmax  Processor 


Figure  3-33:  Ambiguity  Functions  for  ^  —  2R  with  redefined  Replica  Vector 
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(a)  Matched  MVDR  Processor  (b)  Mismatched  MVDR  Processor 


(c)  Matched  Bartlett  Processor  (d)  Mismatched  Bartlett  Processor 


(e)  Adaptive  Minmax  Processor 


Figure  3-34:  Ambiguity  Functions  for  ^  =  R  with  redefined  Replica  Vector 
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(a)  Matched  MVDR  Processor  (b)  Mismatched  MVDR  Processor 
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(c)  Matched  Bartlett  Processor  (d)  Mismatched  Bartlett  Processor 
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(e)  Adaptive  Minmax  Processor 


Figure  3-35:  Ambiguity  Functions  for  w  0  with  redefined  Replica  Vector 
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K 

11 

22 

44 

min 

2 

3 

4 

5 

max 

18 

32 

38 

46 

ave 

8.12 

13.62 

17.56 

20.49 

Table  3.3:  Iterations  of  Complementary  Pivot  Algorithm  vs  K 

be  performed.  From  Section  2.3,  this  problem  is  solved  by  finding  the  nonnegative 
vectors  £  and  £  contained  in  (where  K  is  the  number  of  points  in  the  sampled 
environmental  parameter  set  $)  and  the  real  nonnegative  real  scalar  z  which  satisfy 

minzo  s.t. 

^0,  L  E  >  0  , 

=  0  >  and 

U  -  Mif,z)i-izo  =  rif,i)  +  l 

During  numerical  tests  of  the  Adaptive  Minmax  Matched  Field  Processor  using  the 
deterministic  and  random  ideal  waveguide  acoustic  propagation  models,  the  number 
of  iterations  required  to  solve  the  minmax  problem  were  tabulated.  These  tests  were 
not  Monte  Carlo  simulations.  The  tests  were  conducted  for  iif  =  11,  22,  33,  and  44. 
The  results  in  Table  3.3  show  the  minimum,  the  maximum,  and  the  average  number 
of  iterations  required  to  find  a  solution  to  the  minmax  problem.  These  results  hce 
based  on  4830  trials. 

The  upper  bound  on  the  number  of  required  iterations  can  be  explicitely  calcu¬ 
lated.  By  construction  of  the  complementary  pivot  algorithm,  only  one  variable  from 
each  complementary  piur  can  be  in  the  basis  at  any  given  time.  With  the  exception 
of  the  termination  of  the  algorithm  when  the  variable  z  leaves  the  basis,  there  will 
be  one  and  only  one  complementary  pair  which  has  neither  variable  in  the  basis.  At 
termination,  each  complementary  pair  will  have  one  variable  in  the  basis  and  one  vari¬ 
able  outside  the  basis.  Therefore,  since  there  are  K  complementary  pairs  of  variables, 
there  are  possible  non-terminating  bases. 
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From  Section  2.3,  two  of  the  essential  points  of  the  convergence  proof  for  the 
complementary  pivot  algorithm  [26]  are  that,  at  each  iteration  the  basis  changes  and 
no  basis  can  be  visited  by  the  algorithm  more  than  once.  Therefore,  since  there  are 
only  possible  non-terminating  bases,  this  number  is  also  an  upper  bound  on 

the  number  of  iterations  required  to  find  a  solution. 

This  number  shows  an  exponential  growth  in  complexity  with  the  number  of 
environmental  states.  This  is  an  upper  bound  on  the  number  of  iterations,  and  does 
not  accurately  reflect  the  actual  complexity  of  the  algorithm.  Treating  M(/,z)  and 
r(/,  £)  as  random  variables  with  some  restrictions  on  their  distributions,  the  results 
presented  in  Section  2.5  of  [27]  place  an  upper  bound  on  the  mean  number  of  iterations 
required  to  find  a  solution.  This  upper  bound  is  This  indicates  O  (K^)  growth 

in  the  expected  number  of  iterations  required  to  find  a  solution.  When  compared  to 
results  in  Table  3.3,  the  upper  bounds  on  both  the  maximum  number  of  iterations 
and  the  mean  number  of  iterations  appear  to  be  very  conservative. 

Finally,  the  results  shown  above  use  the  required  number  of  iterations  to  find  a 
solution  as  a  measure  of  the  complexity  of  the  complementary  pivot  algorithm.  Of 
equal  importance  is  the  amount  of  computation  required  for  each  iteration.  The  bulk 
of  the  computation  in  each  iteration  of  the  algorithm  involves  computing  K  ratios  to 
determine  which  variable  will  leave  the  basis  at  that  iteration,  and  implementing  the 
Gaussian  elimination  which  adds  one  variable  to  the  basis  and  removes  another  vari¬ 
able  from  the  basis  which  requires  K  multiplications  and  2K'*  additions.  Therefore, 
each  iteration  requires  approximately  2K^  additions  and  2K  multiplications. 
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Chapter  4 


Matched  Field  Calculation  of  the 
Signal  Replica  Vector 


Matched  field  calculation  of  the  signal  replica  vector  requires  that  the  cross-correlation 
functions  of  the  signals  received  at  each  of  the  array  sensors  be  calculated  taking  into 
account  the  propagation  characteristics  of  the  ocean  environment.  These  characteris¬ 
tics  are  parameterized  by  the  statistics  of  the  temporally  and  spatially  varying  sound 
speed  structure  of  the  ocean  C(z,t).  This  chapter  does  not  present  original  work 
on  the  development  of  methods  to  calculate  the  required  cross-correlation  functions 
given  the  statistics  of  C{z,t),  Rather,  it  explains  how  the  theory  and  the  methods 
developed  by  others  can  be  applied  to  the  calculation  of  the  required  cross-correlation 
functions. 

In  general,  for  each  region  in  the  oceans,  season,  and  time  of  day,  it  is  fairly 
accurate  to  model  C{i,t)  as  the  sum  of  a  time-invariant  equilibrium  component  and 
a  time-varying  stochastic  component.  The  equilibrium  component  is  a  function  of 
only  the  depth.  The  stochastic  component  is  a  function  of  the  three-dimensional 
location  and  the  time  [30,  32].  That  is, 

C(i,<)  =  C„(l  +  Uo{z)  + 

where  z  denotes  depth,  £  denotes  three-dimensional  location,  and  Co  is  a  reference 
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sound  speed.  V{z)  =  (7o(l  +  ^o(^))  is  the  equilibrium  (or  deterministic)  component 
of  the  sound  speed  structure.  This  component  is  dominated  by  the  seasonally  and 
geographically  varying  temperature  and  salinity  profiles  of  the  ocean  and  the  constant 
pressure  gradient  of  the  oceans.  Con(z,i)  is  a  zero-mean  stochastic  process  capturing 
the  random  component  of  the  sound  speed  structure.  In  the  open  oceans,  CoH{z,  t) 
is  dominated  by  the  internal  waves  propagating  in  the  oceans  [30]  and  the  tides  [31]. 
It  is  assumed  that  fi(z,  <)  is  a  temporally  wide-sense  stationary  process. 

The  relationship  between  the  signal  replica  vector  and  the  spatial/temporal  cross¬ 
correlation  function  of  the  desired  signal  is  derived  in  Section  4.1.  There  are  a  number 
of  methods  available  for  calculating  the  spatial/ temporal  correlation  function  of  the 
desired  signal  given  ^(2)  and  the  second-order  statistics  of  C'o/i(2,<).  One  of  these 
methods  utilizes  the  ray  approximation  to  the  solution  of  the  wave  equation  and  is 
described  in  Section  4.2.  Another  method  uses  the  normal  mode  approximation  to 
the  solution  of  the  wave  equation  and  is  described  in  Section  4.3.  A  third  method, 
which  is  to  numerically  solve  the  wave  equation  without  approximations,  is  briefly 
described  in  Section  4.4. 


4.1  The  Spatial/Temporal  Cross- Correlation  Function 

For  the  three  definitions  of  the  signal  replica  vector  contained  herein,  the  signal  replica 
vector  depends  on  only  the  conditional  a  priori  cross-correlation  of  the  snapshots  of 
the  desired  signal  as  received  at  different  array  sensors.  That  is, 

where  the  superscript  /  denoting  the  DTFT  segment  number  has  been  dropped.  From 
Section  2.2 

m)  = 

m*0 

and  i^[m,2]  is  a  multi-dimensional  temporally  wide-sense  stationary  random  process. 
Ignoring  the  distortion  to  the  desired  signal  introduced  by  the  continuous- time  anti- 
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aliasing  filter  and  the  temporal  sampling  process,  the  cross-correlation  function  can 
be  expressed  as 

(W-l) 

E[X,(/,i)X:(/,  r)  I  il  =  i:  (M-  I  j  A.,  2)i:(0,i)  |  f ,  (4.1) 

where  Xi(t,z)  is  the  continuous-time  desired  signal  received  at  the  array  sensor. 
Therefore,  the  calculation  of  the  signal  replica  vector  requires  calculating  the  spa¬ 
tial/temporal  correlation  function  of  the  desired  signal  at  the  array  sensors. 

From  Subsection  2.2.1,  the  point  source  at  the  array  focal  point  is  assumed  to 
be  a  wide-sense  stationary  random  process.  Let  E[xi(sAt,z)iJ(0,z)  |  /o]  be  the 

cross-correlation  function  on  the  righthand  side  of  (4.1)  for  the  case  where  the  point 
source  emits  a  deterministic  complex  exponential  Let  Px{f)  be  the  power 

spectral  density  of  the  wide-sense  stationary  point  source.  Then,  assuming  that  the 
ocean  is  a  linear  acoustic  medium,  the  spatial/temporal  cross-correlation  function  in 
(4.1)  for  the  wide-sense  stationary  point  source  is 

E[zi(sAt,i)x;(0,i)  I  ^  =  [  P^{fo)E[xi{sAu2)xl{0,z)\^,fo]dfo. 

Therefore,  (4.1)  can  be  rewritten  as 

,00 

(4.2) 

That  is,  the  cross-correlation  function  of  the  snapshots  of  the  desired  signal  can  be 
calculated  by  integrating  the  individual  contributions  made  by  sources  emitting  de¬ 
terministic  complex  exponentials  over  an  appropriate  range  of  frequencies.  Therefore, 
for  the  remainder  of  this  chapter,  it  will  be  assumed  that  the  signal  emitted  by  the 
point  source  at  the  array  focal  point  is  the  deterministic  complex  exponential 
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and  the  development  will  focus  on  calculating 

(M-l) 

»-(«-!) 

(4.3) 

As  a  practical  matter,  the  frequency  selectivity  of  the  DTFT  makes  it  possible  to 
closely  approximate  (4.2)  by  considering  the  frequencies  /«  in  a  small  neighborhood 
of  /,  the  extent  of  which  is  roughly  inversely  proportional  to  the  segment  length  of 
the  DTFTs  and  A«.  The  dependence  of  E[Xj(/,£)X]f(/,«)  |  ^,/o]  on  the  quantity 
if—fo)  is  explicitly  shown  in  (4.7),  (4.8),  (4.22),  and  (4.24).  These  equations  provide 
some  insight  into  the  extent  of  the  neighborhood  of  /  over  which  it  is  necessary  to 
evaluate  E[Xi(/,z)A’;(/,z)  | 

Given  that  the  deterministic  ocean  component  is  modeled  as  being  horizontally- 
stratified,  the  sound  field  received  at  any  sensor  which  would  have  propagated  through 
the  deterministic  ocean  can  be  assumed  to  have  propagated  completely  within  the 
vertical  plane  containing  both  the  source  and  the  sensor.  Any  location  within  this 
plane  will  be  denoted  by  its  horizontal  range  from  the  source  r  and  its  depth  z. 
The  horizontal  distance  from  the  source  to  the  array  sensor  will  be  denoted  by  Ri. 
Throughout  this  section,  the  dependence  of  the  desired  signal  on  the  source  location  s. 
will  be  assumed  to  be  understood  and  the  source  location  argument  will  be  dropped 
from  the  desired  signal  £(<,i).  In  addition,  for  the  remainder  of  this  section,  the 
dependence  of  the  calculations  required  to  compute  the  expectations  in  (4.3)  on  the 
assumed  value  of  the  environmental  parameter  ^  will  be  assumed  to  be  understood 
and  the  environmental  parameter  will  be  dropped  from  the  conditional  expectation 
notation. 


4.2  Ray  Approximation 

The  ray  treatment  of  the  effects  of  the  deterministic  and  stochastic  components  of 
the  sound  speed  structure  on  the  propagation  of  sound  is  developed  in  [30].  The  ray 
approximation  models  the  behavior  of  the  propagating  sound  field  by  assuming  that 
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the  direction  of  propagation  at  any  point  is  in  a  direction  normal  to  the  wavefront 
at  that  point.  This  approximation  is  only  valid  at  sufficiently  high  frequencies,  the 
definition  of  which  depends  on  the  scale  size  of  the  sound  speed  fluctuations  in  the 
ocean.  In  general,  the  smaller  the  scale  size,  the  higher  the  frequency  necessary  in 
order  for  this  approximation  to  be  valid. 

At  frequencies  for  which  the  ray  approximation  is  valid,  the  sound  field  is  modeled 
as  propagating  along  paths  which  obey  Snell’s  law.  The  sound  field  received  at 
any  point  can  then  be  expressed  as  the  sum  of  the  fields  which  propagated  along 
the  possibly  multiple  paths  which  obey  Snell’s  Law  and  connect  the  source  to  the 
receiver.  Here,  the  term  ray  will  denote  a  path  from  the  source  to  any  location  of 
interest,  which  obeys  Snell’s  Law  and  is  calculated  using  the  deterministic  sound 
speed  structure  ^(2).  The  term  path  will  be  used  to  refer  to  an  actual  path  followed 
by  the  sound  as  it  propagates  from  the  source  to  any  location  of  interest  calculated 
using  the  true  sound  speed  structure  C(z,  t).  A  path  results  from  the  perturbations 
to  a  ray  caused  by  the  fluctuating  ocean  processes  modeled  by 

The  paths  which  result  from  the  perturbations  to  a  particular  ray  lie  within  a 
ray  tube  surrounding  the  ray.  The  size  of  this  ray  tube  depends  upon  the  strength 
and  scale  length  of  the  random  process  t).  Following  the  convention,  but  not  the 
notation  of  [30],  the  received  signal  which  propagated  through  the  ray  tube  from 
the  source  to  the  array  sensor  can  be  expressed  as  the  product  of  the  signal  which 
would  have  propagated  along  the  ray  in  an  unperturbed  deterministic  ocean  and 
a  reduced  wavefunction  which  is  a  stochastic  process  dependent  on  ft(z,  t))  along 
the  propagation  paths  within  the  ray  tube.  That  is, 

Ni 

Xi{t)  =  Xi,(t), 

nsl 

and 

x.»(0  = 

Xi„(t)  is  the  signal  emitted  by  the  source  which  would  have  been  received  at  the 
array  sensor  via  the  ray  in  a  deterministic  ocean.  N,  is  the  number  of  rays 
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connecting  the  source  with  the  i*^  array  sensor.  is  the  reduced  wavefunction 

which  accounts  for  random  perturbations  in  Xi^{i)  due  to  the  random  perturbations 
of  the  sound  speed  structure  along  the  possibly  multiple  time-varying  paths  within 
the  ray  tube. 

The  ray  approximation  developed  in  [30]  treats  the  deterministic  ocean  as  a  linear 
time-invariant  system.  Therefore,  x,,(t)  is  the  product  of  and  a  complex  scaling 
constant  which  accounts  for  the  losses  and  the  time  delay  along  the  ray  between 
the  source  and  the  t‘*  sensor.  can  be  considered  a  modulating  signal  which 

accounts  for  the  Doppler  spread  introduced  by  the  random  ocean  component. 

The  spatial/temporal  correlation  functions  required  in  (4.3)  can  be  expressed  as 

E[ii(»A.)i:(o)  I  /.]  =  i;  £  x,-.(» A,)  i:„(o)  \  /j.  (4.4) 

nsl  ni=l 

The  expectation  operator  has  been  removed  from  the  first  two  factors  of  each  term 
in  (4.4)  because  they  are  not  stochastic  processes.  These  factors  can  be  evaluated  by 
calculating  the  delay  and  attenuation  introduced  along  each  ray  by  the  horizontadly- 
stratified  deterministic  ocean.  The  second  factor  in  each  term  can  be  evaluated  by 
calculating  the  statistics  of  the  fluctuations  in  x,„(t)  caused  by  the  random  ocean 
component  n{z,t)  along  the  n‘*  ray. 

The  calculation  of  x,„(sA()  required  in  (4.4)  is  conceptually  straightforward.  The 
only  rays  which  are  considered  are  those  which  are  fully  refracted  within  the  ocean 
(i.e., those  which  do  not  have  surface  or  bottom  interaction).  Therefore,  the  rays 
considered  are  those  whose  angle  with  respect  to  the  horizontal  is  always  furly  small 
(the  cutofl*  of  15‘’  is  used  in  [30]). 

For  long-range  propagation,  the  exclusion  of  bottom-reflected  rays  is  reasonable, 
because  the  high  losses  which  usually  occur  during  bottom  interaction  prevent  the 
rays  having  such  interaction  from  contributing  much  energy  to  the  received  signal. 
A  slight  modification  to  the  techniques  developed  in  [30],  which  makes  possible  the 
inclusion  of  surface-reflected  rays,  is  presented  at  the  end  of  this  section. 

Given  the  small-angle  assumption,  Snell’s  Law  relating  the  depth  of  the  ray 
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connecting  the  source  to  the  sensor  at  a  range  r  from  the  source  is 

1  +  U.Mr)) 

2ri^(0)  =  ^toureei  and 

|,=o  =  »i.(0). 

0i„(O)  is  the  initial  angle  of  the  ray  with  respect  to  the  horizontal,  z,ouree  is  the  source 
depth,  and  Zi„(r)  is  the  depth  of  the  n*^  ray  at  the  range  r.  If  the  vertical  separation 
of  the  source  and  the  array  sensor  is  small  compared  to  their  horizontal  separation 
Ri,  the  acoustic  path  length  along  the  n**  ray  can  be  reasonably  approximated  by 
Ri  +  where 

and  is  called  the  eikonal.  Then,  i,„(sA|)  can  be  expressed  as 

x<„(sA0  =  (4,5) 

where  ko  =  2x/o/C'o  is  the  nominal  wavenumber  of  the  signal  and  Ki„  is  the  normal¬ 
ization  factor  mentioned  on  page  83  and  discussed  in  Part  IV  of  [30].  This  calculation 
can  be  carried  out  for  all  rays  connecting  the  source  to  each  of  the  array  sensors. 
Here,  the  signal  attenuation  due  to  absorption  by  the  medium  has  been  ignored.  If 
the  difference  in  the  absorption  along  different  rays  may  be  significant,  an  absorption 
factor  can  be  included  in  (4.5). 

The  calculation  of  the  second-order  statistics  of  ^,„(sAt),  which  are  required  to 
evaluate  (4.4),  is  much  more  involved.  To  begin,  several  parameters  used  to  charac¬ 
terize  the  behavior  of  the  medium  within  each  ray  tube  are  introduced.  The  first  is 
the  Fresnel-zone  radius,  Rpif"),  at  the  horizontal  distance  r  from  the  source.  Here,  the 
indices  t  and  n,  denoting  the  endpoint  of  the  ray  and  the  ray  number,  are  dropped. 
Intuitively,  Rfi^)  is  a  measure  of  the  radius  of  a  ray  tube  through  which  a  path  could 
pass  without  having  its  acoustic  path  length  differ  from  the  acoustics  path  length 
of  the  ray  by  more  than  half  a  wavelength.  More  precisely,  Rpir)  is  defined  as  the 
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maximum  distance  such  that  for  any  point  =  (^o,r)  for  which  \zo  —  Zi„(r)\  <  /ip(r), 
the  following  is  condition  holds.  As  the  point  p  is  moved  from  to  po,  let 

the  ray  from  the  source  to  the  array  sensor  be  continuously  deformed  into  two 
rays,  one  from  the  source  to  p  and  the  other  from  p  to  the  array  sensor.  Then,  the 
sum  of  the  acoustic  path  lengths  of  the  two  rays  must  always  differ  from  the  acoustic 
path  length  of  the  n*^  ray  by  less  than  half  a  wavelength. 

The  next  parameter  is  a  measure  of  the  strength  of  the  sound  speed  fluctuations 
modeled  by  /i(r,r,<)  along  any  ray  where,  as  mentioned  before,  the  location  parame¬ 
ters  r  and  z  denote  the  horizontal  range  from  the  source  and  the  depth  in  a  vertical 
plane  containing  the  ray  of  interest.  Let  71„(r)  =  1  +  3  Then,  using 

the  small-angle  approximation  made  possible  by  the  exclusion  of  the  rays  which  are 
not  fully  refracted,  this  strength  parameter  (not  to  be  confused  with  the  set  of 
environmental  conditions  $  defined  earlier)  is  defined  as 


7i„(r)/i(r,z,„(r) 


where  the  dependence  of  on  /<,  is  not  shown  in  the  notation.  For  the  cases 
where  the  signal  propagates  through  the  ray  tube  via  only  a  single  path,  this  strength 
parameter  is  the  mean  squared  phase  fluctuation  in  x,’„(t). 

The  final  parameter  is  a  measure  of  the  diffraction  caused  by  the  spatial  extent  of 
sound  speed  fluctuations  along  any  ray.  This  diffraction  parameter  A,„  is  a  weighted 
average  of  along  the  ray  and  can  be  calculated  as  described  in  Chapter  7  of 
[30].  Here,  Bf  is  the  Fresnel-zone  radius  defined  earlier  and  L  is  the  scale  length  of 
the  stochastic  process  p.  For  large  A,„,  the  sound  field  propagating  within  a  ray  tube 
of  radius  Bjr  may  see  very  different  sound  speed  structures  within  any  cross-section 
of  the  tube.  The  sound  field  will  be  diffracted,  resulting  in  many  paths  within  the 
ray  tube.  For  small  A,,  and  small  the  sound  field  propagating  within  a  ray  tube 
of  radius  Rf  will  pass  through  roughly  the  same  sound  speed  structure  within  any 
cross-section  of  the  tube  and  will  propagate  along  only  a  single  path  within  the  tube. 

With  these  dofomed  parameters,  the  space  is  divided  into  three  regions. 


The  first,  referred  to  as  the  saturated  region,  is  defined  as  the  region  where  >  1 

and  >  1.  In  this  region,  the  sound  speed  perturbations  are  of  sufScient  strength 
and  with  a  sufficiently  small  scale  length  so  that  the  acoustic  field  propagating  in  the 
ray  tube  follows  multiple  paths,  called  micromultipaths.  In  addition,  the  micromulti¬ 
paths  are  spread  over  a  sufficiently  large  area  so  that  the  sound  speed  perturbations 
along  each  of  the  different  micromultipaths  are  uncorrelated.  The  second  region, 
referred  to  as  the  unsaturated  region,  is  defind  as  the  region  where  <  1  or 

<  1.  In  this  region,  the  scale  length  of  the  sound  speed  perturbations  is  large 
enough  or  the  strength  of  these  perturbations  is  small  enough  so  that  the  acous¬ 
tic  field  continues  to  propagate  along  a  single  path  within  the  ray  tube.  The  final 
region,  referred  to  as  the  partially-saturated  region,  is  defined  as  the  region  where 
>  1  and  Aj„$i„  <  1.  In  this  region,  the  sound  field  follows  several  different 
micromultipaths  along  which  the  sound  speed  perturbations  are  correlated. 

The  reduced  wavefunction  for  any  particular  ray  can  be  expressed  as 

=  y  d(paths)[e"'^ 


where  the  first  integral  is  over  all  micromultipaths  in  the  ray  tube,  2p»th(^)  is  the 
depth  of  the  particular  micromultipath  at  the  horizontal  range  r  from  the  source,  and 
<p«th(»’)  is  the  time  at  which  the  sound  field  received  at  time  t  at  the  sensor  via 
the  particular  multipath  would  have  been  at  a  horizontal  range  r  from  the  source. 
For  conceptual  and  notational  simplicity,  it  has  been  assumed  here  that  the  paths 
resulting  from  perturbation  to  a  particular  ray  remain  in  the  vertical  plane  containing 
that  ray.  The  second  integral  has  the  interpretation  of  the  phase  difference  between 
x,„(t)  and  the  signal  received  at  the  i*^  array  sensor  via  the  particular  path  over  which 
the  integration  is  carried  out. 

In  the  unsaturated  region,  the  reduced  wavefunction  accounts  for  the  random 
phase  perturbation  to  the  received  signal  resulting  from  the  perturbation  to  the  ray 
which  yields  a  single  path  within  the  ray  tube.  In  the  partially-saturated  and  sat¬ 
urated  regions,  the  reduced  wavefunction  has  the  interpretation  of  representing  the 


random  phase  and  amplitude  perturbations  to  the  received  signal  resulting  from  the 
addition  of  the  signals  which  were  subject  to  possibly  uncorrelated  random  phase 
perturbations  as  they  propagated  along  the  multiple  paths  within  the  ray  tube.  Un¬ 
der  the  assumption  that  the  random  fluctuations  in  the  sound  speed  structure  do  not 
increase  or  decrease  the  energy  in  the  received  sound  field,  |  /«]  =  1. 

Assuming  that  the  phase  perturbations  introduced  along  the  individual  paths 
are  jointly  Gaussian  random  variables,  the  quantity  E[^,„(sAt)^J^(0)  [  /„]  can  be 
expressed  as 

D{in^km,sAt^fo)  is  the  phase-structure  function  for  the  spatial  separation  between 
the  and  array  sensors  and  the  temporal  separation  of  sA«.  D{in,km,sAt,fo) 
can  be  interpreted  as  the  variance  of  the  phase  difference  between  signals  received  at 
different  points  in  space  and  time  caused  by  the  random  fluctuations  in  the  sound 
speed  structure  of  the  ocean.  D{in,km,sAufo)  is  defined  as 

(4.6) 

e[(/^‘  i;.(r)Mr,Ji.(r),(.-.(r))rfr  -  n.(r)Kr,j^(r),(..(r))  *)"]  . 

Here  t,„(r)  is  the  time  at  which  the  signal  received  at  the  array  sensor  via  the 
ray  at  the  time  t  =  sAj  would  have  been  at  the  range  r  from  the  source.  Likewise, 
is  the  time  at  which  the  signal  received  at  the  array  sensor  via  the  ray 
at  the  time  <  =  0  would  have  been  at  the  range  r  from  the  source.  As  defined  earlier, 
^•n(r)  is  the  depth  of  the  ray  from  the  source  to  the  array  sensor  at  the  range 
r  from  the  source.  The  assumption  has  been  made  that  the  radius  of  any  ray  tube 
is  small  compared  to  the  scale  over  which  the  statistics  of  fi{z,  t)  undergo  significant 
change.  This  allows  the  statistics  of  V’CO  ^  calculated  using  the 

statistics  of  along  the  ray  within  that  tube. 

As  mentioned  earlier,  D{in,km,sAt,fo)  can  be  interpreted  as  the  variance  of  the 
phase  difference  between  signals  received  at  different  points  in  space  and  time  caused 
by  the  random  fluctuations  in  the  sound  speed  structure.  As  either  the  temporal 
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separation,  sAt,  or  the  average  spatial  separation  between  the  ray  to  the  t**  array 
sensor  and  the  m‘*  ray  to  the  ib**  array  sensor  increase,  the  correlation  between  the 
random  fluctuations  in  the  phases  of  and  is  reduced.  Therefore, 

as  either  the  temporal  or  the  average  spatial  separation  increases,  D(in,km^sAt,fo) 
approaches  a  limit  of  the  STim  of  the  variance  of  the  random  phase  fluctuations  in 
each  of  the  two  signals.  That  is,  D(tn,fcm>sAt,/o)  In  addition, 

D{in,kjn,sAufo)  is  roughly  proportional  to  and  Therefore,  as  either  of 

these  parameters  increases,  the  cross-correlation  between  ^.^(sAt)  and  goes 

to  zero. 

The  evaluation  of  (4.6)  can  be  handled  as  two  separate  cases.  In  the  first  case,  the 
ray  from  the  source  to  the  array  sensor  can  be  considered  a  simple  shifting  of 
the  ray  from  the  source  to  the  array  sensor.  The  rays  in  Figure  4-1  illustrate 
such  a  case.  In  this  case,  the  perturbations  introduced  along  each  of  the  two  rays  may 
or  may  not  be  correlated.  In  order  to  allow  for  the  possiblity  of  this  correlation,  the 
phase- structure  function  must  be  evaluated  as  described  in  Section  7.3  of  [30].  Using 
the  method  described  therein,  the  integration  of  the  statistics  of  /i(r,z(r),t(r))  along 
the  and  rays  is  approximated  by  an  integration  along  a  ray  midway  between 
the  two  rays.  In  the  second  case,  the  ray  from  the  source  to  the  array  sensor 
has  an  entirely  different  form  than  the  m*^  ray  from  the  source  to  the  k*^  array  sensor. 
The  rays  in  Figure  4-2  illustrate  such  a  case.  In  this  case,  the  perturbations  introduced 
along  each  of  the  two  rays  can  be  assumed  to  be  uncorrelated.  As  mentioned  in  the 
preceding  paragraph,  the  phase-structure  function  for  this  case  reduces  to 

With  appropriate  substitutions  made,  (4.3)  can  be  written  as 

nsl  msl 

I  s  ()c-^o{<»>*"-^*)] .  (4.7) 
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The  term  in  the  square  brackets  is  the  windowed  DTFT  of  evaluated 

at  the  frequency  (/  —  /<,). 

If  the  correlation  time  (or  time-constant)  of  the  random  sound  speed  fluctuations  is 
greater  than  the  segment  length  used  by  the  DTFT  ((M  —  1)A(),  then  the  temporal 
incoherence  between  the  signals  received  at  different  times  separated  by  less  than 
(M  —  l)At  can  be  ignored.  That  is,  defining  B  =  as  the  bandwidth  of  the 

first  stage  DTFT  of  the  array  processor  and  as  the  correlation  time  (or  time- 
constant)  of  /i,  the  temporal  incoherence  can  be  ignored  when  BT^  >  1.  In  this  case, 
D{kn,kmiSAufo)  «  D{kn,km,0,fo)  and  (4.7)  can  be  simplified  to 


Enii  Em‘=l  Ki„Kk„ei'‘o(S.’n-S,„)e-^D(iM.Jo)^ 


-  fo)AiM)\ 

V  sin^(7r(/- /o)Ai)  ) 

(4.8) 


The  calculation  of  the  strength  parameter,  the  diffraction  parameter,  and  the 
phase-structure  function  can  be  done  using  the  spectral  functions  of  the  internal- 
wave  field  and  the  tidal  variations.  If  other  sources  of  perturbation  to  the  sound 
speed  structure  are  to  be  considered,  the  spectral  function  must  ‘nclude  the  effects 
of  these  processes.  Measurements  of  the  amplitude  and  phase  fluctuations  of  a  signail 
propagating  within  a  single  ray  tube  have  indicated  that  the  phase  fluctuations  are 
less  than,  and  the  amplitude  fluctuations  are  greater  than,  those  predicted  by  ray- 
based  random  propagation  theory  [31]. 

It  is  important  to  note  that  only  the  components  of  the  temporal  spectrum  cor¬ 
responding  to  fluctuations  with  time  constants  of  on  the  order  of  or  smaller  than 
the  observation  interval  ((M  —  l)At)  should  be  included  in  these  calculations.  The 
processes  with  time  constants  much  greater  than  the  observation  interval  will  not 
show  strong  fluctuations  over  the  interval  and  are  more  appropriately  treated  as 
time-invariant  but  unknown  characteristics  of  the  medium. 

As  presented  thus  far,  the  ray  treatment  of  the  effects  of  the  deterministic  and 
stochastic  components  of  the  sound  speed  structure  of  the  ocean  on  the  propagation 
of  sound  considers  only  fully  refracted  rays.  As  mentioned  earlier,  bottom-reflected 
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rays  can  be  ignored  because  of  the  high  losses  which  usually  occur  during  bottom 
interaction.  However,  the  exclusion  of  surface-reflected  rays  which  have  no  bottom 
interaction  cannot  be  justified  on  this  basis.  Such  rays  will  be  those  emitted  by  sources 
which  are  much  closer  to  the  surface  than  to  the  bottom  and  for  which  the  initial 
angle  of  propagation  with  respect  to  the  horizontal  is  fairly  small.  The  following  minor 
modification,  exploiting  the  Lloyd  mirror  effect,  extends  the  treatment  to  handle  these 
rays  [40]. 

For  the  purpose  of  calculating  the  rays  from  a  source  at  depth  xr«o«ree  to  each 
array  sensor,  an  image  ocean  is  placed  on  top  of  the  actual  ocean  and  the  ocean 
surface  is  ignored.  Therefore,  letting  Zd  be  the  ocean  depth,  the  propagation  medium 
extends  from  Zd  to  —Zd  rather  than  from  0  to  Zd  where  2  =  0  is  the  actual  ocean 
surface.  The  sound  speed  profile  of  the  image  ocean  is  the  mirror  image  of  the  sound 
speed  profile  of  the  real  ocean  (i.e.,  ^(2)  =  ^(—2)).  In  addition,  an  image  source 
is  placed  at  the  depth  —Zgouret  ^<1  image  sensor  is  placed  at  the  corresponding 
image  position  of  the  real  sensor.  A  ray  is  then  traced  from  the  actual  source  to 
the  actual  sensor  with  an  initial  propagation  angle  of  B  and  possibly  passing  through 
both  the  actual  and  the  image  ocean.  A  corresponding  ray  is  traced  from  the  image 
source  to  the  image  sensor.  The  combination  of  the  portions  of  these  two  rays  which 
lie  in  the  actual  ocean  (shown  by  the  dotted  line  in  Figure  4-3)  is  the  actual  ray  from 
the  actual  source  to  the  actual  sensor  with  an  initial  propagation  angle  of  B.  This 
modification  allows  the  surface-reflected  rays,  the  corresponding  eikonals  (5i„),  and 
the  corresponding  deterministically-propagated  signals  (zi,(sAf))  to  be  calculated  by 
selecting  2i,(r)  to  follow  the  ray  contained  in  the  actual  ocean. 

An  additional  effect  which  must  be  considered  is  the  increased  perturbation  to 
the  reduced  wavefunction  caused  by  the  scattering  of  the  reflected  signal  from  the 
turbulent  sea  surface.  This  is  accomplished  by  modifying  the  definition  of  the  phase- 
structure  function  (4.6)  to  include  a  term  D„  which  is  proportional  to  the  number  of 
surface  reflections  and  the  strength  of  the  surface  turbulence  and  which  is  inversly- 
proportional  to  the  spatial  correlation  distance  and  the  temporal  correlation  time 
of  the  surface  turbulence.  Experimental  results  [40]  indicate  that,  for  moderate  and 


long-range  propagation  problems,  this  term  is  much  smaller  than  the  term  accounting 
for  the  signal  fluctuations  induced  by  the  internal  wave  field  and  can  be  safely  ignored. 


4.3  Normal  Mode  Approximation 

The  normal  mode  treatment  of  the  effects  of  the  deterministic  and  stochastic  compo¬ 
nents  of  the  sound  speed  structure  on  the  propagation  of  sound  is  developed  in  [32]. 
The  normal  mode  approximation  models  the  behavior  of  the  propagating  sound  field 
by  asuming  that  the  field  is  the  sum  of  horizontally-propagating  vertical  standing 
waves  called  normal  modes.  The  standing  waves  are  the  result  of  the  constructive 
and  destructive  interference  between  successive  upward  or  downward-going  wavefronts 
which  have  been  reflected  from  the  sea  surface  and  bottom.  This  approximation  is 
valid  at  suflBciently  low  frequencies,  the  definition  of  which  depends  on  the  depth  of 
the  ocean.  In  an  ocean  with  a  depth  of  4000  meters,  the  frequency  at  which  the 
normal  mode  approximation  breaks  down  is  somewhere  between  200  Hz  and  500  Hz 
[33].  As  the  ocean  depth  decreases,  the  frequency  above  which  the  normal  mode 
approximaton  is  no  longer  valid  will  increase. 

E^h  normal  mode  is  characterized  by  its  modal  shape  <f>n{2)  (not  to  be  confused 
with  the  environmental  parameter  ^  defined  earlier)  and  its  horizontal  wavenumber 
is  the  vertical  standing  wave  set  up  by  the  reflections  of  the  propagating 
sound  field  from  the  sea  surface  and  bottom.  Assuming  the  ideal  case  where  the  sea 
surface  is  modeled  as  a  free  surface  and  the  sea  bottom  is  modeled  as  a  perfectly 
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rigid  bottom,  the  set  of  horizontal  wavenumbers  for  any  temporal  frequency  of  at 
which  the  modes  exist  will  be  a  discrete  set.  Therefore,  there  will  exist  only  a  finite 
number  of  modes  with  a  vertical  wavenumber  small  enough  so  that  the  horizontal 
wavenumber  is  a  real  number  {kn  =  where  k  is  the  wavenumber  and 

Jbv  is  the  vertical  wavenumber).  Therefore,  there  are  only  a  finite  number  of  modes 
for  which  the  vertical  standing  wave  propagates  in  the  horizontal  direction.  The 
evanescent  (non-propagating)  modes  decay  exponentially  in  range.  Due  to  this  rapid 
attenuation,  they  are  ignored  here. 

A  more  realistic  assumption  about  the  sea  bottom  is  that  it  is  not  perfectly  rigid 
and  is  characterized  by  a  finite  sound  speed  Ch  and  a  finite  density  pj.  Then,  for  wave- 
fronts  with  propagation  angles  greater  than  the  critical  angle  of  the  water/bottom 
interface  (here  the  propagation  angle  is  measured  with  respect  the  horizontal),  the 
sound  field  will  not  be  entirely  reflected  back  into  the  water  column  amd  some  energy 
will  propagate  into  the  bottom.  For  these  wavefronts,  perfect  cancellation  of  succes¬ 
sive  bottom  reflections  by  destructive  interference  is  not  possible.  Thus,  a  continuum 
of  horizontal  wavenumbers  in  the  region  kn  <  kcosOc  will  correspond  to  propagating 
modes.  These  modes  are  referred  to  as  the  continuous  spectrum.  [34] 

For  wavefronts  with  horizontal  wavenumbers  in  the  region  kij  >  kcosBc,  the  mag¬ 
nitude  of  the  bottom  reflection  coefficient  will  be  one,  and  perfect  cancellation  will 
occur.  Therefore,  in  this  region  the  set  of  horizontal  wavenumbers  corresponding  to 
propagating  modes  will  be  a  finite  discrete  set.  These  modes  are  referred  to  as  the 
discrete  spectrum. 

Due  to  the  leakage  of  the  modes  in  the  continuous  spectrum  into  the  bottom, 
these  modes  are  attenuated  rap  y.  A  good  rule  of  thumb  is  that  at  ranges  greater 
than  3  or  4  times  the  ocean  depth,  the  contribution  of  the  continuous  spectrum  to 
the  propagating  sound  field  can  be  ignored  [39].  Therefore,  it  will  be  assumed  that 
the  desired  signal  received  by  the  array  sensors  consists  of  only  propagating  modes 
in  the  discrete  spectrum. 

Following  the  convention  in  [32],  it  will  be  assumed  that  all  array  sensors  lie  in 
a  vertical  plane  which  also  contains  the  source,  and  the  signal  reaching  any  of  these 
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sensors  will  be  assumed  to  have  propagated  within  this  plane.  Therefore,  the  spatial 
coordinate  system  used  will  be  the  same  as  that  used  in  Section  4.2  with  z  representing 
depth  and  r  representing  range  from  the  source.  z,o^tc*  will  denote  the  depth  of  the 
source  and  the  pjur  (i2i,  Zi)  will  denote  the  position  of  the  array  sensor. 

As  mentioned  earlier,  the  propagating  sound  field  is  considered  to  be  the  finite 
sum  of  propagating  modes.  Each  propagating  mode  is  expressed  as  the  product  of 
two  factors.  The  first  accounts  for  the  effect  which  the  deterministic  ocean  has  on 
the  propagating  modes.  The  second  factor,  the  reduced  wavefunction,  is  a  stochastic 
process  which  accounts  for  the  effects  which  the  random  ocean  has  on  the  propagating 
modes.  Therefore,  the  sound  field  received  at  the  t**  array  sensor  can  be  expressed  as 

M 

ns=l 

where 

”  ®»n(0  V’tnCO* 

is,  to  within  a  scale  factor,  the  mode  of  the  signal  emitted  by  the  source 
which  would  have  been  received  at  the  t‘*  array  sensor  in  a  deterministic  ocean.  M 
is  the  number  of  propagating  modes  in  the  discrete  spectrum.  is  the  reduced 

wavefunction  which  accounts  for  the  amplitude  of  the  excitation  of  the  mode  by 
the  source,  as  well  as  the  random  perturbations  in  Xj„(<)  due  to  the  perturbations 
of  the  sound  speed  structure  (/i(r,r,<))  in  the  region  between  the  source  and  the 
array  sensor.  The  development  in  [32]  assumes  that  the  array  sensors  lie  at  a 
common  range  from  the  source.  This  restriction  is  not  a  problem  when  computing 
the  products  involving  x{t)  at  two  different  ranges,  but  is  restrictive  when  computing 
the  cross-correlation  of  the  reduced  wavefunction  at  two  different  ranges.  Therefore, 
the  normal  mode  approximation  presented  here  is  applicable  for  computing  the  signal 
replica  vector  for  only  vertical  arrays  (i.e.,  R  =  Ri  —  Rj  Vt,  j). 

Using  the  same  notation  as  in  Section  4.2,  the  spatial /temporal  correlation  func- 
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tions  required  in  (4.3)  can  be  expressed  as 


E1x,(3A.)i:(0)  I  /,]  =  E  L  *:.(»)  El(i,.(jA,)«„(0)  I  /.I.  (4.9) 

nsl  msl 

The  calculation  of  Xj„(sAt)  is  strsightforward.  At  ranges  where  R  is  sufficiently 
large  so  that  the  argument  kjf^R  justifies  the  use  the  large  argument  approximation 
of  the  Hankel  function  of  the  second  kind  [5],  Xi„(sAi)  can  be  expressed  as 

(4  10) 

Substituting  (4.10)  into  the  homogeneous  wave  equation  yields  the  following  differ¬ 
ential  equation  which  must  be  satisfied  by 


The  boundary  conditions  on  the  solutions  to  (4.11)  are  those  corresponding  to  the 
ideal  case  of  a  free  surface  and  a  perfectly  rigid  bottom.  These  correspond  to  zero 
pressure  at  the  surface  (^(0)  =  0)  and  zero  particle  displacement  at  the  bottom 
l»=w=  0  where  H  is  the  ocean  depth).  Given  these  boundary  conditions, 
the  solutions  to  (4.11)  will  be  orthogonal  functions  (i.e.,  ^„(z)^m(r)rfz  =  6nm 

where  6nm  is  the  Dirac  delta  function)  [4j.  The  M  solutions  to  (4.11)  corresponding 
to  the  discrete  propagating  modes  are  generally  arranged  in  descending  order  of  kfj^. 
Throughout  this  section,  will  denote  the  solutions  to  (4.11),  which  are  calculated 
using  ^(z)  as  the  sound  speed  structure. 

As  was  the  case  with  the  ray  approach  to  calculating  the  signal  replica  vector, 
the  calculation  of  the  second  order  statistics  of  the  reduced  wavefunction  is  much 
more  involved  than  the  calculation  of  The  fundamental  phenomenon  affecting 

the  reduced  wavefunction  is  as  modal  coupling.  To  understand  modal  coupling,  first 
consider  the  case  of  the  deterministic  horizontally-stratified  ocean  modeled  by  ^(z). 
In  this  case,  0,„(t)  =  <^n(^«»«ree)-  The  level  of  modal  excitation  depends  on  only  the 
mode  shape  and  the  source  depth.  At  the  range  Vo  the  propagating  sound  field  is 
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given  by 


ar(^o,^,^)  =  ^  (ijf„rc)  (4.12) 

n=l 

Then,  by  the  application  of  Huygens’  principle,  the  field  propagating  forward  from 
range  can  be  considered  to  have  been  excited  by  a  continuum  (in  depth)  of  point 
sources  at  range  r,,,  each  with  a  source  level  equal  to  x(r<„  z,  t).  That  is,  the  excitation 
level  of  the  m**  mode  in  the  field  propagating  forward  from  range  ro  can  be  expressed 
as 

[  <f>„,{z*)x{ro,z',t)dz'.  (4.13) 

Jo 

Substituting  (4.12)  into  (4.13)  and  using  the  orthogonal  relationship  between  the 
modal  shapes,  the  excitation  level  for  the  mode  reduces  to 

(4.14) 

Therefore,  the  mode  is  excited  at  range  r,,  if  and  only  if  the  m*^  mode  was 
originally  excited  by  the  source  and  it  is  excited  at  range  ro  with  an  amplitude  equal 
to  its  original  amplitude  reduced  by  an  amount  to  account  for  the  loss  due  to  geometric 
spreading.  Modal  propagation  which  obeys  this  model  is  referred  to  as  adiabatic  mode 
propagation,  implying  the  independent  propagation  of  the  individual  modes. 

Now,  assume  that  the  deterministic  sound  speed  profile  changes  at  the  range  r^. 
Then  the  modal  shapes  for  r  >  r*,,  denoted  by  ^{,(z),  will  be  different  from  those  for 
r  <  To.  Thus,  substituting  (4.12)  into  (4.13)  to  yield  an  expression  for  the  excitation 
level  of  the  mode  propagating  forward  from  the  range  Vo  results  in 

f”  (4,15) 

n=l  ■'0 

Since  the  modal  shapes  for  r  >  (^J„(z))  are  the  solutions  to  a  different  differential 

equation  than  the  modal  shapes  for  r  <  To  (^„(z)),  an  orthogonal  relationship  between 
these  mode  shapes  does  not  necessarily  exist;  therefore,  (4.15)  cannot  be  reduced  in 
the  fashion  which  yielded  (4.14).  Therefore,  in  the  situation  where  the  deterministic 
sound  speed  profile  changes  at  r  =  to,  the  excitation  level  of  the  m**  mode  propagating 
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forward  from  the  range  r*,  is  coupled  to  the  excitation  level  of  all  of  the  modes  at  the 
range  Vo.  Consistent  with  the  development  in  [32],  the  waves  traveling  backward  in 
range  resulting  from  the  reflection  off  of  the  discontinuity  in  the  sound  speed  profile 
at  the  range  will  be  ignored.  The  resulting  propagation  model  is  referred  to  as  the 
forward  scattering  approximation  to  coupled  mode  propagation. 

While  the  deterministic  sound  speed  profile  is  range-invariant,  the  actual  sound 
speed  structure  changes  with  range  as  a  result  of  the  effect  of  the  random  process 
fi{r,  z,  i).  The  modal  coupling  which  is  introduced  by  these  random  fluctuations  in 
the  sound  speed  structure  is  represented  by  the  reduced  wavefunction.  For  the  case 
where  ^(r,z,<)  1  (which  holds  for  almost  all  ocean  acoustics  problems  [40]),  the 

coupling  coefficient  between  the  m‘*  and  n‘*  modes  is 

Jo 

The  coupling  coefficient  is  a  function  only  of  range  and  time,  which  results  in  the 
reduced  wavefunction  being  a  function  only  of  range  and  time.  Under  the  reasonable 
assumption  [32]  that  is  small,  referred  to  as  the  quasistatic  approximation, 

the  reduced  wavefunction  evolves  according  to  the  following  differential  equation, 

n€{l,...,M},  (4.16) 

m=l 

with  the  initial  conditions  V’n(Ojf)  =  <l>n{z,ouTee)  Vn. 

By  previous  assumption,  the  ranges  Ri  =  Rj  =  R  and  as  a  result,  V’i„(0  = 
V>»(i2,,<)  =  V’n(^>0-  Therefore,  the  cross-correlation  required  in  (4.9)  can  be  ex¬ 
pressed  as 

E[*,(iA.)«.(0)  I  /.]  =  E[^,(«.»A,)C(H.0)  I  /.).  (4.17) 

The  modal  approximation  theory  developed  in  [32]  does  not  yield  an  expression 
which  accounts  for  the  temporal  decorrelation  of  the  reduced  wavefunction  at  a  par¬ 
ticular  range.  However,  as  described  in  Section  4.2,  when  BTf^  >  1,  (4.17)  can  be 
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closely  approximated  by 


I  /.] « EiMmrMo)  I  /.)•  (418) 

The  cross-correlation  function  of  the  solutions  to  the  differential  equation  (4.16) 
is  given  by 

<l)rM  0)  I  /.)  =  »V.(«)  <»«■  (4.19) 

The  assumption  which  underlies  this  expression  is  that,  at  the  ranges  for  which  the 
normal  mode  approximation  as  given  in  (4.10)  is  valid,  the  modal  phases  are  inco¬ 
herent  from  mode  to  mode.  Experimental  evidence  [41]  has  indicated  that  the  modal 
phases  may  be  partially  correlated  at  these  ranges.  Therefore,  the  validity  of  this 
assumption  may  be  in  doubt. 

lV„(r)  is  the  power  in  the  mode  at  range  r  and  can  be  shown  to  be  the  solution 
to  the  following  differential  equation. 

=  E  “«m(»n.(r)  -  H'„(r)),  (4.20) 

mat 

m^n 

with  the  initial  conditions  IV’n(O)  =  ^n(^»o«rce)  Vn.  Onm  is  the  non-negative  constant 
described  in  the  following  paragraph.  The  fact  that  Onm  is  non-negative  indicates 
that  the  effect  of  the  modal  coupling  will  be  to  transfer  power  from  the  modes  with 
greater  amounts  of  power  to  the  modes  with  lesser  amounts  power. 

Onm  is  a  modal  coupling  parameter  given  by 

=  ^E[l  i  r  A.m(r)e->'**--‘''-''<(r  1=],  (4.21) 

where  e  is  the  small  parameter  described  on  page  358  of  [32].  c  is  roughly  equal  to 
LR  where  L  is  the  scale  length  in  range  for  changes  in  the  random  process  ^  and 
R  is  the  scale  length  in  range  for  changes  in  the  random  process  Rnm-  Under  the 
assumption  that  /i  is  a  wide-sense  stationary  random  process  in  range  (i.e.,  spatially 
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homogeneous  in  range),  (4.21)  can  be  expressed  as 

Here, 

F{k,z,z')  ^  r  E[M{A„z'J)^(0,z,t)U->^dA, 

oo 

is  the  horizontal  wavenumber/vertical  position  spectrum  of  the  random  process  fi  and 
can  be  determined  from  the  spectrum  of  the  internal  waves,  tidal  oscillations,  and 
other  sources  of  significant  perturbations  in  the  sound  speed  structure  of  the  ocean; 
and  A'  is  a  constant  which  is  roughly  equal  to  As  was  the  case  in  Section  4.2,  it  is 
important  to  note  that  only  the  components  of  the  temporal  spectrum  corresponding 
to  fluctuations  with  time  constants  of  on  the  order  of  or  smaller  than  the  observation 
interval  ((M  —  l)At)  should  be  included  in  these  calculations. 

Combining  (4.19),  (4.18),  (4.10),  (4.9),  and  (4.3)  yields  the  following  expression 
for  Sikif)  given  the  deterministic  source 


)-'Mzi)Mzi.)W4R) 

(4.22) 


4.4  Numerical  Solution  of  the  Wave  Equation 

The  numerical  solution  of  the  wave  equation  does  not  require  approximation  to  the 
wave  equation  but  can  handle  the  dependence  of  the  signal  replica  vector  on  only 
^(z).  The  solution  is  the  signal  as  received  at  the  array  sensor  assuming  that 
the  source  emits  the  deterministic  complex  exponential  and  that  the  ocean  is  a 
deterministic  horizontally-stratified  medium.  Assuming  that  the  ocean  is  also  a  linear 
acoustic  medium,  the  solution  to  the  wave  equation  can  be  expressed  as 

Ii(()  = 
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where  i,(/o)  is  the  transfer  function  of  the  ocean  between  the  source  and  the 
array  sensor.  With  these  assumptions,  the  expression  for  the  spatial/temporal  cross¬ 
correlation  function  of  the  desired  signal  reduces  to 

Eli..(sA.)i:(0)  I  /.)  =  ii(/.)  *:(/.).  (4.23) 


Substituting  (4.23)  into  (4.3)  and  evaluating  the  DTFT  of  the  triangle  function  yields 


Eix,(/,i)x;(/,i)  1  (4.24) 


Efficient  algorithms  such  as  the  SAFARI  program  [29]  are  available  to  numerically 
compute  solutions  to  the  wave  equation.  Since  this  approach  cannot  take  into  account 
the  effect  of  the  random  perturbations  to  the  sound  speed  structure  of  the  ocean,  it 
should  only  be  used  in  situations  where  the  temporal  coherence  interval  of  the  ocean 
is  longer  than  the  segment  length  used  by  the  first  stage  DTFT  of  the  array  processor 
and  where  perfect  spatial  coherence  can  be  assumed  across  the  aperture  of  the  array. 
The  first  of  these  two  conditions  is  roughly  equivelent  to  BTft  >  1. 
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Chapter  5 


Conclusions  and  Future  Work 


The  problem  of  matched  field  processing  without  precise  knowledge  of  the  charac¬ 
teristics  of  the  propagation  environment  has  been  addressed  herein.  Previous  work 
summarized  in  Section  1.3  has  documented  and  analyzed  the  performance  degreda- 
tion  which  both  adaptive  and  non-adaptive  matched  field  processors  suffer  when  a 
mismatch  exists  between  the  environmental  conditions  assumed  by  the  processor  and 
those  which  actually  exist.  A  new  algorithm  for  implementing  a  matched  field  proces¬ 
sor  which  is  capable  of  operating  with  only  approximate  environmental  information 
yet  which  offers  the  high  resolution  and  interference  rejection  capability  characteris¬ 
tic  of  adaptive  processors  has  been  developed  using  the  framework  of  minmax  signal 
processing. 

The  matched  field  processor  developed  consists  of  discrete-time  Fourier  TVans- 
form,  followed  by  a  linear  weight-and-sum  beamformer,  followed  by  a  module  which 
averages  the  magnitude  squared  of  the  output  of  the  beamformer.  This  output  of 
this  final  module  is  an  estimate  of  the  average  power  in  a  narrowband  signal  emit¬ 
ted  by  a  source  at  the  array  focal  point  as  received  at  a  particular  sensor  in  the 
array.  This  signal  is  referred  to  as  the  desired  signal  and  the  particular  sensor  is 
referred  to  as  the  reference  sensor.  The  weights  of  the  beamformer  are  choosen  to 
minimize  the  maximum  conditional  mean-squared  error  between  the  output  of  the 
beamformer  and  the  desired  signal.  The  error  is  conditioned  on  the  characteristics 
of  the  propagation  environment  and  the  maximum  is  evaluated  over  a  user  specified 
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range  of  environmental  conditions  over  which  the  processor  is  designed  to  operate. 
The  processor  developed  using  this  minmax  framework  achieves  good  performance 
over  a  range  of  environmental  conditions  and  does  not  require  precise  knowledge  of 
the  characteristics  of  the  propagation  environment. 

The  fundamental  theorem  upon  which  the  minmax  signal  processing  framework 
is  built  is  the  Characterization  Theorem.  This  theorem,  which  is  presented  in  Sec¬ 
tion  2.1,  provides  a  nice  physical  interpretation  of  the  necessary  and  sufficient  condi¬ 
tions  satisfied  by  the  solutions  to  a  large  class  of  minmax  approximation  problems. 
The  practical  importance  of  the  theorm  lies  in  the  fact  that  makes  possible  the  de¬ 
velopment  of  an  efficient  algorithm  to  solve  for  the  optimal  minmax  array  weights 
when  the  average  signal  power  is  initially  unknown.  First,  in  Section  2.2  it  is  used 
to  prove  that  the  solution  to  the  minmax  array  weight  problem  given  any  assumed 
average  signal  power  can  be  expressed  as  a  real  non-negative  constant  multiplied  by 
the  solution  to  the  minmax  array  weight  problem  given  any  other  non-zero  assumed 
average  signal  power.  Finally,  in  Section  2.3  it  is  used  to  prove  that  the  optimad  min¬ 
max  array  weights  are  equivalent  to  the  minimum  mean-squared  error  array  weights 
when  the  “least  favorable  pmf*  is  assigned  to  the  environmental  conditions.  This 
least  favorable  pmf  interpretation  of  the  optimal  minmax  array  weights  motivates 
the  development  of  an  efficient  algorithm  to  solve  for  this  particular  pmf  and  makes 
possible  the  efficient  solution  of  the  minmax  array  weight  problem  which  is  of  interest. 

The  Characterization  Theorem  has  two  other  important  consequences.  First, 
it  motivated  the  work  in  in  Section  2.4  in  which  the  form  of  the  optimal  minmax 
mean-squared  error  estimator  and  an  approach  to  developing  bounds  on  the  minmax 
mean-squared  error  (>erformance  achievable  by  any  estimator  are  derived.  Finally, 
in  Section  3.1  the  algorithm  derived  using  the  Characterization  Theorem  leads  to 
the  interpretation  of  the  Adptive  Minmax  Matched  Field  Processor  as  a  Two-Stage 
Minimum  Variance  Distortionless  Response  (MVDR)  Matched  Field  Processor.  This 
relates  the  minmax  processor  to  a  processor  whose  performance  characteristics  are 
well  understood,  improves  the  understanding  of  what  the  minmax  processor  does  and 
how  it  can  be  expected  to  perform,  and  motivates  two  small  modifications  to  the 
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minmax  processor. 

The  numerical  analysis  of  the  processor's  performance  shows  that  in  a  determin¬ 
istic  medium  (Subsections  3.2.1  and  3.2.2),  the  processor  is  able  to  maintain  good 
performance  in  the  presence  of  environmental  uncertainty.  However,  the  results  in 
Subsection  3.2.3  show  that  when  the  medium  in  randomly  time-variant,  the  per¬ 
formance  of  both  the  Adaptive  Minmax  Matched  Field  Processor  and  the  MVDR 
Matched  Field  Processor  suffers.  Part  of  this  performance  loss  can  be  atributed 
the  loss  of  signal  correlation  across  the  aperture  of  the  array  which  makes  it  diffi¬ 
cult  for  any  processor  which  incorporates  a  linear  beamformer  to  perform  effectively. 
However,  some  of  this  performance  degradation  is  attributed  to  the  fact  that  both 
processors  model  the  desired  signal  as  having  a  rank  one  cross-spectral  correlation 
matrix.  Therefore,  since  the  cross-spectral  correlation  matrix  has  a  rank  of  greater 
than  one  in  random  propagation  medium,  the  processors  are  not  correctly  matched 
to  the  signal  characteristics  in  this  case. 

In  applications  where  the  random  media  fluctuations  are  dominated  by  the  internal 
waves  propagating  in  the  ocean,  the  processor's  poor  performance  in  random  media 
may  not  be  a  significant  handicap.  This  can  be  observed  by  noting  that  the  spectrum 
of  the  internal  waves  is  dominated  by  components  below  around  3  cycles  per  hour 
[37,  38].  Therefore,  for  observation  intervals  on  the  order  of  several  seconds  the 
media  will  appear  to  be  effectively  time-invariant  but  unknown  during  any  observation 
interval.  In  these  cases,  the  sample  cro5s-spectr£d  correlation  matrix  will  have  a  low- 
rank  and  the  rank  one  model  will  be  applicable. 

Further  work  needs  to  be  done  to  develop  methods  appropriate  for  applications 
where  significant  ocean  variation  is  expected  over  a  single  observation  inverval.  Using 
the  notation  of  Chapter  4,  these  methods  must  be  able  to  deal  with  uncertainty  in 
the  deterministic  depth-dependent  sound  speed  structure  (C{z))  of  the  oceans  while 
accommodating  the  random  ocean  fluctuations  (Co/x(2,  t))  which  may  be  significant 
over  a  single  observation  interval. 
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Appendix  A 

Proofs  for  General  Minmax 
Problems 


The  following  two  definitions  will  be  used  in  the  proofs  in  Appendicies  A  and  B. 

Definition:  Open  Neighborhood  of  a  Set  Let  A  be  a  subset  of  the  metric 
space  5.  Then,  for  any  ^  >  0,  the  open  neighborhood  A  of  radius  ^  in  5  is 

B(A,S)  =  {a  €  S  I  Bco  €  A  s.t.  d(a«»o)  < 

where  d :  5  x  5  — »  R  is  the  distance  function  for  the  metric  space  S. 

Definition:  Closed  Neighborhood  of  a  Set  Let  A  be  a  subset  of  the  metric 
space  S.  Then,  for  any  ^  >  0,  the  closed  neighborhood  of  A  of  radius  8  in  5,  denoted 
by  ^(A,5),  is  the  closure  of  B{A,6)  in  S. 

A  number  of  theorems  taken  directly  from  [2]  will  be  used  in  the  proofs  in  Appen¬ 
dicies  A  and  B.  They  will  be  referred  to  by  their  number  in  [2]  and  are  stated  below 
without  proof. 

Theorem  2.19  Every  neighborhood  is  an  open  set. 

Theorem  2.23  A  set  E  is  open  if  and  only  if  its  complement  is  closed. 

Theorem  2.24(a)  For  any  collection  {Go}  of  open  sets,  Ua  Go,  is  open. 

Theorem  2.35  Closed  subsets  of  compact  sets  axe  compact. 

Corollary  to  Theorem  2.35  If  F  is  closed  and  K  is  compact,  then  F  H  F  is 
compact. 

Theorem  4.6  Suppose  X  and  Y  axe  metric  spaces,  F  C  A,  p  ^  E,  p  is  a 
limit  point  of  F,  and  /  maps  F  into  Y.  Then,  /  is  continuous  at  p  if  and  only  if 
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linix^P  f{x)  =  f{p). 

Corollary  to  Theorem  4.8  A  mapping  /  of  a  metric  space  X  into  a  metric  sp2ice 
Y  is  continuous  if  and  only  if  is  closed  in  X  for  every  closed  set  C  in  Y. 

Theorem  4.0  Let  /  and  g  be  complex  continuous  functions  on  a  metric  space  X. 
Then  f  +  g,  fg,  and  Jjg  are  continuous  on  X. 

Theorem  4.14  Suppose  /  is  a  continuous  mapping  of  a  compact  metric  space  X 
into  a  metric  space  Y.  Then  f{X)  is  compact. 

Theorem  4.16  Suppose  /  is  a  continuous  real  function  on  a  compact  metric  space 
X,  and 

M  =  sup  /(p),  m  =  inf  /(p). 

Then  there  exist  points  p,q  £  X  such  that  f{p)  —  M  and  f{q)  =  m. 

Theorem  4.19  Let  /  be  a  continuous  mapping  of  a  compact  metric  space  X  into 
a  metric  space  Y.  Then  /  is  uniformly  continuous  on  X. 

Theorem  5.10  If  /  is  a  real  continuous  function  on  [a,  b]  which  is  differentiable  in 
(a,  b),  then  there  is  a  point  x  €  (a,  b)  at  which  f(b)  —  /(a)  =  (i  —  a)f'{x). 

In  addition,  there  are  two  theorems  in  [3]  which  will  be  used  in  the  proofs  in 

Appendicies  A  and  B.  They  will  be  referred  to  by  name  and  are  stated  below  without 

proof. 

Theorem  of  Carath4odory  Let  A  be  a  subset  of  an  n-dimensional  linear  space. 
Every  point  in  the  convex  hull  of  A  (H  (A))  is  expressible  as  a  convex  linear  combi¬ 
nation  of  n  -f  1  or  fewer  elements  of  A. 

Theorem  on  Linear  Inequalities  Let  1/  be  a  compact  subset  of  A  necessary 
and  sufficient  condition  that  the  system  of  linear  inequalities  <u,  z>  >  0  Vu€l/ 
be  inconsistent  is  that  Q  €  {U). 

The  following  three  lenunas  will  be  used  in  later  proofs  of  theorems. 

Lemma  A.l  Let  A  be  a  subset  of  the  Euclidian  metric  space  B  be  a  compact 
subset  of  the  metric  space  F,  and  /:AxB— »Ilbea  continuous  function  on  A  and 
B.  Then 

A(a)=  mw  /(a,  6) 

is  a  continuous  function  on  A. 

Proof:  /(a,  b)  is  a  real  function  continuous  on  B.  Then,  by  Theorem  4.16,  for  each 
o  €  A,  36o  €  B  such  that  f{a,bo)  =  sup^^  /(o*^)  therefore  A(a)  =  f{a,bo). 

Assume  that  A(a)  is  not  continuous  on  A.  Then  Boq  €  A  and  co  >  0  such  that 
V6o  >  0,  3ai  €  A  such  that  d(ao,ai)  <  So  and  |  A(ao)  —  A(ai)  |  >  €q. 

Assume  that  A(ao)  >  A(ai).  If  this  is  not  the  case,  the  roles  of  Oq  and  ai  can 
be  switched  in  the  remainder  of  the  proof.  As  established  earlier,  36d  €  B  such  that 
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/(ao,6o)  =  A(ao).  By  definition,  /(ai,6o)  <  A(ai).  Therefore,  |  f{ao,bo)- f{ai,bo)  | 
^  Co* 

Therefore  3ao  €  A  and  Cq  >  0  such  that  V^o  >  0,  3ai  €  A  such  that  d(ao,  Oi)  < 
So  and  |  /(ao>ho)  —  /(oiifco)  |  >  «o*  Therefore,  /  is  not  continuous  at  cq  which 
contradicts  the  continuity  of  /  on  A.  Therefore,  the  assumption  must  be  wrong. 
Therefore,  A(a)  is  continuous  on  A.  m 

Lemma  A.2  Let  A  be  a  convex  subset  of  the  Euclidian  metric  space  E,  B  be  a 
compact  subset  of  the  metric  space  F,  and  f  :  Ax B  —*  fl  he  a,  convex  function  on  A 
a  continuous  function  on  B.  Then 


A(a)=  m^  /(a,fe) 

is  a  convex  function  on  A. 

Proof:  /(a,  b)  is  a  real  function  continuous  on  B.  Then,  by  Theorem  4.16,  for  each 
o  €  A,  3bo  €  B  such  that  f{a,bo)  =  supj^g  /(a,  6)  and  therefore  A(a)  =  f{a,bo). 

Choose  any  cq,  oi  €  A  such  that  oq  ^  ai.  Then  by  the  convexity  of  A  and  of  /  on 
A 


VA€(0,1],  V6€B,  /(Aao  +  (l-A)a„6)<A/(ao,6)  +  (l-A)/(a„6). 

By  definition,  V6€  B,  f{ao,b)  <  A(ao)  and  /(ai,6)  <  A(ai).  Therefore, 

VA  €  [0, 1],  V6  €  P,  /(Aflo  +  (1  -  A)ai,  6)  <  AA(ao)  +  (1  -  A)A(o,).  (A.l) 

As  established  earlier 

VA€(0,1]  36(A)  €Bs.t. 

/(Aoo  +  (1  -  A)ai,6(A))  =  A(Aao  +  (1  -  A)aj).  (A.2) 

Choosing  6  in  (A.l)  to  be  6(A)  and  substituting  (A.2)  into  (A.l)  yields 
VA  €  [0, 1]  A(Aao  +  (1  -  A)ai)  <  AA(ao)  +  (1  -  A)A(ai). 

Since  this  is  true  for  any  Oq,  ai  €  A,  A(a)  is  a  convex  function  on  A.  ■ 

Lemma  A.3  Let  X,  y,  and  Z  be  metric  spaces  with  distance  functions  dxi  ,  ), 
dyi  y  ),  and  dz{  ,  ),  respectively.  Suppose  A  C  X  and  B  C  C  C  Yy  and  that 
g  :  A  B  and  f  :  C  -*  Z  aie  continuous  functions.  Then,  f  o  g  :  A  —*  Z  is  a 
continuous  function. 

Proof:  Select  any  p  €  A  and  c  >  0.  Then,  by  the  continuity  of  /,  36o  >  0  such 
that  V6  €  By  dY{byg{p))  <  So  -*  dz{f{b)y  f{g{p)))  <  c.  By  the  continuity  of  g, 
36i  >  0  such  that  Vo  €  A,  d;r(a,p)  <  tfi  -♦  dy(p(a),p(p))  <  So.  Therefore,  3^i  >  0 
such  that  Vo  €  A,  dx{ayp)  <  Si  dz{f{g{a))y  f{g{p)))  <  c.  Therefore,  /  op  is  a 
continuous  function  on  A.  ■ 
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The  following  definition  of  the  gradient  operator  is  required  in  order  to  prove  the 

Minmax  Characterization  Theorem  which  follows  the  definition. 

Definition:  Gradient  Operator  Let  $  be  a  subset  of  a  metric  space  denoted 
by  r,  be  a  subset  of  a  Euclidian  metric  space  denoted  by  E,  e  :  x  $  — >  R  be 
a  continuous  function  on  both  W  and  $  for  which,  at  each  m  €  IV,  a,  directional 
derivative  with  respect  to  iv  can  be  defined  on  ^ .  Then  the  gradient  of  e  with  respect 
to  t£,  denoted  by  is  defined  as  any  function  :  W  -*  E  which  is 

continuous  on  W  and  $  ana  for  which  the  following  is  true. 

WdeE,yweW,\/le9  k  <V^e(w,^,d>, 

where  k  is  a  real  positive  constant  and  <  ,  >  denotes  the  inner  product  defined  on 
E. 

Theorem  1  Let  $  be  a  compact  set  contained  in  a  metric  space  denoted  by  F, 
VF  be  an  open  set  of  a  Euclidian  metric  space  denoted  by  E,  e  :  x  $  — »  E.  be 
a  continuous  function  on  both  W  and  $  for  which,  at  each  w  €  W,  a  directional 
derivative  with  respect  to  w  can  be  defined  on  and  gradient  of 

£  with  respect  to  tn.  Then  a  necessary  condition  for  €  Vv  to  oe  a  solution  to  the 
following  minmax  problem 

—  arg  min  max£(u>, 

is  that 

a  €  W  {{V^e  I  ^  €  Miw,)}). 

If,  in  addition,  e  is  a  convex  function  of  j£  and  W  is  a  convex  set,  this  condition  is 
a  necessary  and  sufficient  condition  for  €  W  to  be  the  solution  solution  to  the 
stated  minmax  problem. 

Proof:  c(uio,  0)  is  continuous  on  $  and  $  is  compact.  Therefore,  by  Theorem  4.16, 

3^  €  ^  such  that  e(u^,^)  =  sup^g*  =  A(uj<,).  Therefore,  M(2^)  is 

not  an  empty  set. 

is  continuous  on  $,  $  and  II  are  metric  spaces,  and  the  singleton  set 
{A(uJo)}  is  closed  in  R.  Therefore,  by  Theorem  4.8,  M{wo)  is  closed  in 

M{wg)  is  closed  in  $,  and  $  is  compact.  Therefore,  by  Theorem  2.35,  M{w^)  is 
compact. 

Vjj^£  is  a  continuous  function  on  9  and  M{w^)  is  compact.  Therefore,  by 

Theorem  4.14,  the  set  {Vjj,£  |  ^  €  M(uj<,))  is  compact. 

The  necessary  condition  portion  of  the  theorem  will  be  proven  first. 

Assume  that 

HiH  ({V^  (ffi,,#)  1 1  €  M{!S,))). 

I  ^  is  a  compact  subset  of  the  Euclidian  metric  space  E. 


149 


Therefore,  by  the  Theorem  on  Linear  Inequalities,  3d^E  such  that 

<  Vjj^e  >  <  0  (A.3) 

Let  Co  =  sup^^^^)  <  Vjj,e  >. 

Vjjj,e  (l£,^)  is  a  continuous  function  on  W  and  Therefore,  by  Theorem  4.9, 

<  Vjjj£  (t£,  ^ ,  d  >  is  a  continuous  real  function  on  W  and 

<  Vjut  (vioi£}id.  >  is  a  continuous  real  function  on  ^  and  M{xs^)  C  $  is  compact. 

Therefore,  by  Theorem  4.16,  Co  =  max^gA/(ty„)  <  Vjj^e  >•  Therefore,  (A.3) 

implies  that  Co  <  0. 

<  Vuj£  {^y^,d  >  is  a  continuous  real  function  on  $  and  $  is  compact.  There¬ 
fore,  by  Theorem  4.19,  <  Vj^e  >  is  a  uniformly  continuous  function  on 

<  (w^,  ,  d  >  is  a  uniformly  continuous  function  on  $  and  €  M(Wo)  Q  ^  , 

<  (tVoi^^d  >  <  Co  <  0.  Therefore,  by  the  definition  of  a  uniformly  continuous 
function  and  Theorem  4.6, 

3^0  >  0  s.t.  €  5(M(u2<,),^o)  n  $,  <  Vjj,£  >  <  Co/2  <  0. 

B{M{w^),6o)  is  closed  and  $  is  compact.  Therefore,  by  the  Coroll<ury  to  Theo¬ 
rem  2.35,  B{M{j^),6o)  n  $  is  compact. 

<  (!£,^),d  >  is  a  continuous  real  function  on  W  and  ^(M(uj<,),^o)  H  $  is 
compact.  Tnereiore,  by  Lemma  A.l, 

_  max  <  V,„£  (t£,^),d  > 
is  a  continuous  function  on  W.  Therefore, 

3^1  >  0  s.t.  Vj£  €  B({u).},5i)nlV,  max  <  fu;,  , d  >  <  co/4  <  0. 

4'£E{M(wJ,6o)n*  -  \  / 

The  crucial  result  of  the  first  portion  of  this  proof  can  therefore  be  summarized 
as, 

36i  >  0  s.t.  Vui  €  n  14^  and  '^££'B{M{w^),6o)  Ci  (A.4) 

<  >  <  co/4  <  0.  (A.5) 

By  Theorems  2.19  and  2.24(a),  B(M(i£<,),6o)  is  an  open  set.  Therefore,  by  The¬ 
orem  2.23,  the  set  {$  —  B{M{Wf,),So)}  is  closed  in  Therefore,  since  $  is  compact, 
by  Theorem  2.35,  —  B(M(u^),6o)}  is  compact. 


150 


Note  that  €  {$  —  ■5(A/^(|fJo)»^o)},  <  A(ifjo),  and  let 

£i  =  sup  e{jy„,^). 

f€{#-B(A/(ui,)A)} 

^iuLoi^  is  a  continuous  real  function  on  i  and  —  B{M(x^)fSo)}  is  compact. 
Therefore,  by  Theorem  4.16, 


In  addition,  since  £{uLoi^  is  sl  continuous  real  function  on  W,  by  Lemma  A.l, 

max  e(tr, 

is  a  continuous  function  on  W. 

Let  £2  =  (^(nio)  +  €i)/2.  Then  £1  <  £2  <  A(tn^,).  Therefore,  since 

max  e(ui, 

*€{i-S(M(w^),So)} 


is  a  continuous  function  on  W, 


3^2  >  0  s.t.  Vui€  B({jx2o}»^2)n  W,  max  £{w,<f>)  <  £2- 

♦€{*-B(Af(u>o)/o)} 

Then,  since  {$  —  J9(A/(uj<,),^o)}  S  {$  —  B(M(2fj^),5o)}>  the  crucial  result  from 
the  second  portion  of  this  proof  can  be  expressed  as 

3^2  >  0  s.t.  Vt£  €  B({tn^},62)  n  and  €  {^  —  .6(M(ujo),6o)}  H  $,  (A. 6) 

e(ui,4)  <  (3  <  A(u2^).  (A.7) 

W  is  open.  Therefore,  3^3  >  0  such  that  5(u^,53)  C  W.  Then  let  Aq  = 

min(5i,tf2»^3)/(2  |  ^  |)  >  0,  and  define  2x;(A)  =  u>o  +  Ad.  Then,  VA  6  [0,  Aq], 
j^(A)  €  B{w^,6i)  n  W  and  u;(A)  €  5(u^,^2)  H  W.  Therefore,  by  (A.4)  and  (A.5), 

A  €  [0,  Ao]  -»  €  ^(M(2Wc),6o)  n  $,  <  Vjj^£  (l£(A),^),d  >  <  £o/4  <  0. 

By  definition, 

de{w  +  Ad,  <^)  Tt  (  a\  j 

|a=o  —  h  <.  Vjj,£  \  \v,^,d  >, 

where  A;  is  a  real  positive  constant.  Therefore, 

A  €  [0,  Ao]  6  6q)  n  $,  -  <  fc£o/4  <  0.  (A.8) 

2i;(A)  is  a  continuous  function  on  R  and  £(i£,^)  is  a  continuous  function  on  W. 
Therefore,  by  Lemma  A.3,  £(t£(A),^)  is  a  continuous  function  on  R.  Therefore,  by 
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(A.8)  and  Theorem  5.10, 

€  B{M{vio),  So)  n  £(iii(Ao),^  <  eimo,^)  +  Aofcco/4. 

By  definition,  £(ui(Ao),^  <  A(wi,),  and  Ao,fc  >0  and  cq  <  0.  Therefore,  letting 
C3  =  A(iii,)  +  Aofcco/4, 

'^^€'B(M{!iia),So)n^,  £(w(Xo),£  <  £3  <  A(u2<,).  (A.9) 

Let  £4  =  max(c2,C3)  <  A(tij^).  Then,  since  VA  €  [0,  Ao],  t£(A)  €  J5(u>o,^2)  H  W^, 
(A.6),  (A. 7),  and  (A.9)  can  be  combined  to  yield 

V^€$,  £(i£(Ao),^)  <  £4  <  A(u^). 

Therefore,  A(t£(Ao))  <  A(uj^). 

Therefore,  is  not  a  minimizing  point  of  A(tii)  on  W.  Therefore,  a  necessary 
condition  for  to  be  a  solution  to  the  following  minmax  problem 


is  that 

Q.e'H  ({Vjj,£:  (m>,^  I  4  €  M{iv^)]). 

The  sufficient  condition  portion  of  the  theorem  will  now  be  proven. 

Assume  that  is  a  convex  set,  £{wL,£i  is  a  convex  function  on  W,  and  that 

Q.e'H  ({Vjj,£  I  ^  €  A/(u^)}). 

I  ^  is  a  compact  subset  of  the  Euclidian  metric  space  B. 

Therefore,  by  the  Theorem  on  Linear  Inequalities,  Vd  €  E,  3^  €  M{yio)  such  that 

<  V^£  (w^,^),d>  >0. 

Select  any  €  W.  Let  d  =  Wi  —  xHo  £  E  and  define  i£(A)  as  before.  Then 
Wq  =  t£(A)  |a=o  and  ujj  =  i£(A)  |a=i.  Then,  by  the  definition  of  the  gradient 

operator,  3^  €  M{w^)  such  that  |a=o  >  0. 

By  the  definition  of  convex  functions  and  convex  sets, 


VA€[0,1],  £(tii(A),^)  <  (1  -  A)e(uj^,^)  +  Ae(uii,^). 

This  can  be  rewritten  as 

VA€  [0,1],  £(r£(A),^)  <  e(u2<„^)  +  A(£{i£,,^)  -  £(ui,,^))- 

By  the  definition  of  the  directional  derivative 

5£(t£(A),4)  £(3e(A),^)  -  £(1^0,^)  ^  „ 

— ar^l»-<'=  ^  "■ 


(A.IO) 


(A.ll) 
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Combining  (A. 10)  and  (A. 11)  and  carrying  out  some  algebraic  manipulation  yields 
e(siii.^)  >  £(3£o>^)-  Since  ^  e(jSo,^)'=  A(iiJo).  Furthermore, 

A(tiJi)  >  £(j£i,^).  Therefore,  A(i£i)  >  A(jii,).  This  is  true  for  any  €  W. 
Therefore,  is  the  minimizing  point  of  A(ui)  on  W. 

Therefore,  if  e{w,^)  is  convex  on  W  and  is  a  convex  set,  then 

flew  ({V,,£  I  ^  €  mm.))) 

is  a  sufficient  condition  that  is  a  solution  to 

lSop<  =  arg 


Theorem  5  Let  £(g^  ^)  be  any  performance  measure  for  the  estimator  g  given  the 
environmental  condition  (f).  Let  $  =  {<f>U‘  •  •  ■,<f>K}  and  let  g  €  P  =  {£  €  I  £  ^  0  and 
be  any  pmf  assigned  to  <(>  on  Let  £(£,  g)  =  ^i)  and  let  /?(£)  be  any  global 

lower  bound  on  £(£,<?).  That  is, 

^(£)  <  min  £(£,<;). 

a:X-*K, 

Then,  ;?(£)  is  also  a  lower  bound  on  A(5).  That  is, 

A<,=  min  A(y)  >  ^(£). 

g-.X-»a\ 


Proof: 


K  K 

£(£></)  =  <  IIp<(mM£(y,^))  =  maxe(^,^)  =  A(y) 

«=i  «=i 

This  hold  true  Wg  :  X  —*  fi  and  V£  €  P-  Therefore, 


0{p)  <  min  £(£,<?)  <  min  A(y)  =  A^. 


This  holds  true  for  any  2^^  P.  Therefore, 

V£  €  P,  fiil)  <  Ao. 


Theorem  6  Let  $  =  P  =  {£  €  I  £  ^  0  and  £*£  =  1},  and 

Po  :  A"  X  P  — »  R  be  given  by 


9o{x,e) 


r.^1  Pi  Px|»(g  I  ^.)E[g  I  X,  <f>i] 
E.^i  Prp.|^(x  I  <^i)  ’ 


(A.12) 
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where  Px\4>{^  I  ^«)  the  conditional  pdf  or  pmf  of  the  observation  x  given  that  the 
environmental  parameter  ^  Let  £(5, 4)  =  E[(^  —  g(x)y  (  4]- 
Let  the  least  favorable  pmf  €  P  be  defined  as 


K 


hj  =  s(Po(x,£),^<)- 

t=i 


Then 


and  the  quantity 


=  arg  min  mM£(5,^), 


K 


tsl 


is  an  achievable  lower  bound  on 


A(i7)  =  max  e.{g,4)- 


Proof;  This  proof  closely  parallels  that  of  Theorem  4. 

Consider  ^  to  be  a  random  parameter  and  let  g  be  any  assigned  pmf.  Then 

K 

»sl 

is  the  mean  squared  estimation  error  for  the  estimator  g  and  algabraic  manipulation 
of  (A.  12)  shows  that  <7o(x,g)  =  E[5  |  x]  and  is  therefore  the  minimum  mean  squeired 
error  estimator  of  6  given  the  observation  x.  Therefore, 

:  A"  -♦  R  and  Vg  €  P,  e(g,yo(x,g))  <  £(g,5). 

Therefore,  £(g,  <7o(x,g))  is  a  global  lower  bound  on  £(£,</)  and,  by  Theorem  5, 

Vg€  P,  £(g,^o(x,p))  <  Ao=  min  A(^).  (A.13) 

y:X-R 

It  will  now  be  shown  that  £(gj^,i?o(x,g^^))  >  Recall  that 

e(£,Po(x,g))  =  E[(^  -Po(x,g))*  I  4i]- 

•si 


Therefore, 

de{b9o{x,2)) 

dpj 


El(« -  j.(i,e))'  I  i,] ■^'tp‘ 

•si 


dpj 


(A.14) 
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and 


(A.15) 


OVi  m 

Substituting  (A.15)  into  (A.14)  and  carrying  out  the  summation  yields 

Recall  that  ^o(*i£)  =  E{d  |  x]  is  the  minimum  mean  squared  error  estimator  of  6 
given  X.  Therefore, 

El(»  -  «.(*.  E))^^^]  =  E[(«  -  E(»  I  (A.17) 

Since  is  n  function  of  x  and  is  not  a  function  of  d,  the  application  of  the 


(A.18) 


orthogonality  principle  yields 

E[(d-E[d|xl)^^l  =  0. 

Substituting  (A.17)  and  (A.18)  into  (A. 16)  yields 

I  '^'1  =  £(s.(*.E). ■t’i)-  (A-19) 

Let  m  €  {1, ...  ,/^}  be  any  index  such  that  pif„  >  0  and  n  €  {{!,..., if}  -  {m}} 
be  any  other  index.  For  A  €  [0,p//„]  define  £(A)  as  follows:  Pi(A)  =  pj/,  for  i  € 
{{1,...,/^}  -  {m,n}},p,„(A)  =  p,/„  -  A,  and  Pn(A)  =  p//„  +  A. 

Using  the  chain  rule, 


de{piX),go{x,p{X)))  _  ^  deip{X),go(x,p{X)))  dpijX) 
dX  ^  5pi(A)  dX 

Substituting  (A. 19)  into  (A.20)  and  noting  that 


(A.20) 


_  n  W,-  C  //I  _  Irr,  -  1 

—  OVi  €  {{l»**.iif}  ~~  1>  and  1 


yields 


a4g(A),p4x.£(A)))  ^  (A.21) 


Note  that  by  the  definition  of 

VA  €  [0,p,/„],  £(Pc(a:,£(A)),£(A))  <  £(p«>(x,£,^),£,P; 
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and  by  the  definiton  of  £(A), 


Therefore, 


e(^o(x,£(A)),£(A))  |a=o=  e{9oix,2is)^2ij) 


0 


arg 


«(^a(x,£(A)),£(A)) 

A€l0,pi/„1 


Therefore, 

5£(p(A),5<,(a:,£(A)))  ,  ^  „ 

- - |a=o  <  0, 


(A.22) 


Substituting  (A.21)  into  (A.22)  and  noting  that  <7o(a;,£(A))  |a=o=  9o{x,2ij)  yields 


<  e(Po(2,£,y),^m).  (A.23) 

Since  (A.23)  holds  for  any  index  m  such  that  p//„  >  0  and  for  any  other  index  n,  it 
can  be  concluded  that 


P//m  >  0  e{go{x,2ij),<t>m)  =  A(i^^(x,£,^)). 

Using  this  result  and  noting  that  the  following  sum  can  be  carried  out  over  only  those 
indecies  for  which  £,y  >  0  and  that  £^^  =  1  yields 

K 

e(a/ii7o(x,£,p)  =  Y.PisA9o{.x,2ij),<l>i)  =  A((7o(x,£,^)). 

isl 

Therefore, 

Ao  =  ^mii^A(<7)  <  £^{go{x,2ij))  =  e(£,;,Po(x,£,p)-  (A.24) 

Combining  (A.13)  and  (A.24)  and  noting  that  (A. 13)  holds  for  any  pmf  yields 
Ao=  mi^A(5)  =  A(5„(x,£^^))  =  £(£,^,</o(a:,£,y)). 


Therefore, 


</o(x,£,^))  =  Y.Piu€{go{x,2ij),<t>i) 

t=l 

is  an  achievable  lower  bound  on  A(5)  and  this  bound  is  achieved  by  go{x,2jj)- 
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Appendix  B 

Proofs  Specific  to  the  Adaptive 
Minmeix  Array  Processor 


In  the  adaptive  minmax  array  processing  problem,  the  set  W  and  the  Euclidian 
metric  space  E  defined  in  Appendix  A  are  both  the  A-dimensional  complex  plane 
denoted  C^.  This  is  a  2A-dimensional  Euclidian  metric  space  with  the  inner  product 
of  Ml  £  €  define<l  as 

<  Ml  £  >  =  Real(M^£)i 

the  norm  of  m  €  defined  as 


Ul  =  <MiM>^ 

and  the  distance  between  m  €  and  v  €  defined  as 

d^''(£i£)  =  1  M  “  £  I  • 

For  the  error  function  ^  expressed  in  (2.11),  the  gradient  operator 

is  given  by  [28] 

V^£(m1i^)  =  S(/)ui  -  (B.l) 

Under  the  assumption  that  is  continuous  on  and  with  the  constant  k  =  2, 
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this  gradient  operator  meets  all  of  the  requirements  set  forth  in  the  definition  of  the 
gradient  operator  contained  in  Appendix  A. 

The  following  array  weight  characterization  theorem  sets  out  the  necessary  and 
stiificient  conditions  for  a  set  of  array  weights  to  be  the  solution  to  the  minmax  array 
weight  problem. 

Theorem  3  Let  $  be  a  compact  set  contuned  in  a  metric  space  and  £(/,  z,  be 
a  continuous  function  on  Then  a  sufficient  condition  for  to  be  a  solution  to  the 
following  minmax  problem 

is  that 

3J  >  0,  (B.2) 

and 

Slifta)  =  Minu),  (B.3) 

such  that 

!i  €«({($(/)!£.  -  ol^US)  I  i€JW(!a,))).  (B.4) 

A  necessary  condition  for  to  be  a  solution  to  the  following  minmax  problem 

<^l)  =  «-g  min  max  «(/,  z,  i£,  (t^), 

jj,€C 

is  that 

3J€  {l,...,2Ar  +  l}  (B.5) 

for  which  (B.3)  and  (B.4)  are  satisfied. 

Proof:  £(/,  z,  w, <7^)  is  a  continuous  function  of  w  and  £(/,  z, ^);  and  £(/,  z, ^)  is 

a  continuous  function  on  Therefore,  by  Lemma  A. 3,  c{f,z,w,^,  <r*)  is  a  continuous 
function  on  and  $.  $  is  compact  and  at  each  i£  €  a  directional  derivative  with 
respect  to  u;  can  be  defined  on  9.  In  addition,  S(/)  is  positive  definite  and  therefore 
£(/>!»  convex  on  C^.  is  a  convex  set.  Therefore,  by  Theorem  1  and 

the  definition  of  the  gradient  operator  at  the  beginning  of  this  appendix,  a  necessary 
and  sufficient  condition  for  to  be  a  solution  to  the  following  minmax  problem 

=  arg  min  m^£(/,i,Sii,^,<T*), 

JJ;gC 

is  that 

0 €  w  ({(§(/)k.  -  I  i e  Mfc)}).  (B.6) 

If  (B.6)  is  satisfied  on  a  countable  subset  of  then  it  is  also  satisfied  on  M(w^). 

Therefore,  replacing  Af(ij>^)  in  (B.6)  by  an  arbitrary  countable  subset  M{uLo)  and 


noting  that  (B.6)  is  a  sufficient  condition  yields  the  sufficient  conditions,  (B.2)  through 
(B.4),  for  to  t)e  the  desired  solution. 

is  a  2iV'dimensional  linear  space.  Therefore,  if  (B.6)  is  satisfied  on 
by  the  Theorem  of  Carath^dory  it  must  also  be  satisfied  on  a  subset  of  M(uv) 
containing  2N  +  1  or  fewer  elements.  Therefore,  noting  that  (B.6)  is  a  necessary 
condition  yields  the  necessary  conditions,  (B.5),  (B.3)  and  (B.4),  for  to  be  the 
desired  solution.  ■ 

Using  Theorem  (3),  the  following  theorem  can  be  proven. 

Theorem  2  Let  al  be  any  real  positive  number,  $  be  a  compact  set  contained  in 
a  metric  space,  be  a  continuous  function  on  and 

=  arg  min  m«e(/,2,ui,^,<r^). 

jj/cC  - 

Then  for  any  real  non-negative  <r^,  the  solution  to  the  problem 
aicptif, <y^)  =  arg  min  max e(/,  r,  w,  <f>, 

is  given  by 

Proof:  The  following  notation,  which  is  more  explicit  than  that  used  in  the  body 

of  this  thesis  and  in  the  proofs  of  the  other  theorems,  will  be  used  in  this  proof.  The 
extremal  value  will  be  denoted  by  A(/,i,iJi,cr^)  and  the  extremal  point  set  will  be 
denoted  by  A/(/,  5,,  u;,  a^)  where 

A(/,  1,  w,  <t’)  =  in^  £(/,!,  t£,  <t>,  o*) 

£€♦ 

and 

M{f,z,w,a^)  =  I  c(/,z,t£,^,a*)  =  A(/,z,uj,<t*)}. 

For  any  positive  real  <r|,  let 

=  arg  mm  maxe(/,i,ui,^,<T^). 

For  any  positive  real  o’,  define 

le(o’)  =  (oV<t’)  (B.7) 

By  Theorem  3, 

Q  €  W  ({(S(/)  -  <rl  £(/,i,^))  1  i  €  M(/,i,ieopt(/,l,<T’),o’)}). 

(B.8) 
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Substituting  (B.7)  into  (B.8)  yields 


a  €  W  ({(<T„V.T>)  (S(/)  I  i  6  <’1), <'.')))• 

This  implies  that 

a  e  K  ({(§(/)  -  <,»  £(/a.^))  I  i  €  M(/,j,i£^(/,l,<T2),aJ)}),  (B.9) 

Substituting  (B.7)  into  (2.11)  and  carrying  out  some  algebrdc  manipulation  yields 
the  following  expression  for  <i>t(r^)> 

Note  that  £(/,i,t£(<r^),^,<T^)  is  a  equal  to  multiplied  by  a 

positive  number  which  is  independent  of  ^  and  added  to  a  number  which  is  indepen¬ 
dent  of  <f>.  This  implies  that 


Af(/,l,t£(<^  ),<y  )  =  M{f,z,w^t{f,z,al),<Tl) 


(B.IO) 


Substituting  (B.IO)  into  (B.9)  yields 


Therefore,  1£(<t^)  satisfies  the  requirements  of  Theorem  3.  This  implies  that 


Therefore, 


t£(<T’)  =  arg  min  maxe(/,i,t£,^,<T^). 

iv€C’*  ♦€♦  ~ 


Theorem  4  Let  P  be  the  set  of  all  possible  pmfs  which  may  be  assigned  to  $  and 
define  the  least  favorable  pmf  as 

in^n  £(/,z,jii,P,<r’)  =  argm^e(/,z,uj^„„,(/,z,<7j,p),p,<T^) . 


Then 


(B.ll) 


l£cpt(A£><^!)=  arg  min  e(f,  z,  w, <r^)  =  lS^m,e(Ai,<^oyPif)  ^ 
jscC 


Proof:  From  (2.22),  £(/,!,  samm»e>E»<ro)  ca°  he  expressed  as 


«(/,!,  jemm.e,E,<r’)  =  ^  Pi  e(f,  I,  l^m,e(Eh 

fSl 
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Therefore, 


r  -  ...  .X  _2^  .  ^  _  ^g(/,  iamm»e(g)>  <^0) 

'  (B.12) 

Combining  (2.5)  and  (2.24)  yields 


K 


1=1 


K  K 


(<'1?  E  E  PI  s(/)-‘  £(/.!.£„)  p- 

tel  m=l 


Therefore, 

^PJ  tel 

Real(i‘(/,1,£.)  §(/)-•  i(/a,*p)). 


Noting  that  the  first  term  is  independent  of  the  index  i  and  carrying  out  the  summa¬ 
tion  over  i  in  (B.12)  yields 


^  p. - - — =1 — 

tel 


Therefore, 

1 _/ r  _  ...  f^\  A  — 2\  111 

- a - - -  =  e(/,l,!i!mm«(E)>2i’‘^o)- 

dpj  ^ 

Let  m  €  {1,  . . . ,  be  any  index  such  that  pij„  >  0  and  n  €  {{!,••  -  {m}} 

be  any  other  index.  For  A  €  [0,pij„]  define  £(A)  as  follows:  p<(A)  =  pi/^  for  i  € 
{{1,...,^}  -  {m,n}},  pm{>^)  =pif„  -  A,  and  pn(A)  =  pif„  +  A. 

Using  the  chain  rule. 


de(f,i,a„„„,Sf,>'),<rl)  _  ■^^de(f,z,yu„„,2{X),irl)  dpijX) 

~  h  5Pi(A)  8A  ■ 


a\ 


(B.14) 


Substituting  (B.13)  into  (B.14)  and  noting  that 


dpiW 

d\ 


=  0  Vt  €  {{!,..., A")  -  {m,n}}, 


5pm(A) 

d\ 


=  —1,  and 


dpn{\) 

dx 


=  1 


yields 

dX 


^(/>2.jl2mm»e(E('^))’^’  ^o)  ^(/ 4^4l42»nm»e(E(^))’^m’ 

(B.15) 


Note  that  by  the  definition  of  2^p 

VA  €  [0,p//„],  ^(/j^5l4Znim»e»2(A)j  ^o)> 

and  by  the  definiton  of  £(A), 

Therefore, 


0  =  arg  max 


Therefore, 


^g(/,l,tn,nm.e,g(A),<yo) 

dx 


|a=o  <  0.  (B.16) 

Substituting  (B.15)  into  (B.16)  and  noting  that  w„^,^(£(X))  |a=o=  mmmse(Pjf)  yields, 

£(/,!,  2iUnm»e(Ejf)^ln>^o)  <  ^(A  Ir  UlmmseiEj/)^  ^o)-  (B-l?) 

Since  (B.17)  holds  for  any  index  m  such  that  p//„  >  0  and  for  any  other  index  n,  it 
can  be  concluded  that 


Pi/m  >  0  -►  =  A{tn„,„„(p^P). 

Therefore 

Plfm  >  0  -*  <f>m€  M{w^^,^{f,Z,(Tl,p^^)). 

Using  the  notation  of  Corollary  1  to  Theorem  3,  the  desired  subset  of  the  extremal 
point  set  can  be  selected  as 

|  £//,  >  0}. 

Recall  from  (2.24)  that 

K 

=  <^o  S(/)*^  PlJ,  giA  1,  ^i)- 

i=l 

Since  all  environmental  conditions  for  which  the  pmf  is  non-zero  are  contained  in  the 
selected  subset  of  the  extremal  point  set,  this  summation  can  be  carried  out  over 
only  this  subset  Therefore,  and  M(tn„„„,e(/,z,<T^,£,^))  as  selected 

satisfy  the  sufficient  conditions  of  Corollary  1  to  Theorem  3.  Therefore, 

Si2opl(/i  ^o)  ~  J£mm»e(/>i)^o’£/^)’ 


Claim  1  M{f,z)  is  a  co-positive-plus  matrix. 
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Proof:  From  [24] 

Co-positive  matricies  are  (square)  matricies  M  such  that 

X  >  0  — »  x‘A/x  >  0. 

Co-positive-plus  matricies  are  co-positive  matricies  such  that 
X  >  0  and  x*Mx  =  0  -♦  (M  +  Af‘)x  =  0. 


, . .  The  class  of  co-positive-plus  matricies  also  includes  positive-semidefinite 
matricies, . . . 


Therefore,  it  is  sufficient  to  show  that  M(f,z)  is  a  positive-semidefinite  matrix. 
Recall  that 

r(/,j)  t  R*<j(g(/,2)'s(/)-'o(/,z)), 

that  T{f,z)  can  be  broken  into  the  following  components 


Tif,z)  = 


Tuif,z)  r(/,i) 


(B.18) 


and  that  S(/)  is  a  positive-definite  matrix.  Therefore,  T{f,  z)  is  a  positive-semidefinite 
matrix. 

Recall  that  M{f,z)  is  the  following  matrix 


where  the  minor 


M{f,z) 


0  -e‘ 

e  M{f,z) 


M{f,z)  =  f{f,z)  -}-  Ti,{/,z)ce*  -  cf‘(/,z)  -  f(/,i)  e‘.  (B.19) 


Select  any  vector  x  =  [xi,x‘]‘  €  Then 

x'Af(/,x)x  =  x*  M{f,z)  X. 


Substituting  (B.19)  into  this  expression  yields 

x!M{f,z)x  =  x‘Z’(/,i)x-2(c‘£)(i*(/,£)i)  +  rii(/,i)(c‘x)^  (B.20) 

Let  j/  =  [— x,x*]‘  €  R^.  Then,  since  T(f,z)  is  positive-semidefinite,  the 
quadratic  product  £‘r{/,i)£  is  non-negative.  Using  (B.18)  to  expand  this  quadratic 
product  yields 

£‘r(/,z)^  =  i*f{f,z)i-2{&*k)(L\f,l)x)  +  Tn{f,l)(&*S.y  >  0.  (B.21) 
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Note  that  the  righthand  side  of  (B.20)  equals  the  center  section  of  (B.21).  Therefore, 

x!  M if,  z)x  =  }!^T{f,z)r^  >  0. 

Since  this  is  true  for  any  vector  x  €  the  matrix  M{f,z)  is  positive-semidefinite. 
Therefore,  M{f,s)  is  a  co-positive-plus  matrix.  ■ 

Claim  2  The  set  of  equations  (B.22)  through  (B.24)  are  consistent. 


£  >  0,  and  3i  >  0  s.t. 

(B.22) 

u‘£  =  0  and 

(B.23) 

V  -  M(f,z)l  =  l[f,z) , 

(B.24) 

Proof:  Assume  that  the  set  of  equations  (B.22)  through  (B.24)  axe  inconsistent. 

Then,  since  (B.22)  through  (B.24)  are  simply  the  result  of  the  algebraic  manipulation 
of  (B.25)  through  (B.27),  (B.25)  through  (B.27)  must  also  be  inconsistent. 

>  0?  =  1,  3u  €  R,  and  Bn  >  0  s.t.  (B.25) 

=  0  and  (B.26) 

V  —  T{f,z)2^  +  ti  e  =  0.  (B.27) 

Therefore,  since  (B.25)  through  (B.27)  are  the  necessary  and  sufficient  conditions 
which  must  be  met  by  any  solution  to  the  constrained  quadratic  minimization  problem 
(B.28),  this  problem  cannot  have  a  solution. 

Eij  -  E*  T{f,z)^.  (B.28) 

However,  £*  T{f,^  £  is  a  continuous  function  of  £  and  the  set  P  =  {£  €  R^  1  £  > 
0  and  e*£  =  1}  is  compact.  Therefore,  by  Theroem  4.16,  3£^  G  P  such  that 

T{f,z)  =  inf  £*  T{f,z)  £  =  min  £<  T{f,z)  p. 

Therefore,  £^  is  a  solution  to  (B.28)  which  contradicts  the  statement  that  the  problem 
cannot  have  a  solution.  Therefore,  the  assumption  must  be  wrong. 

Therefore,  the  set  of  equations  (B.22)  through  (B.24)  are  consistent.  ■ 
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